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Preface to the Fourth Edition 


This fourth edition, like the first three, addresses the organic 
disorders of voice due to laryngeal structural changes and 
neurologic diseases, as well as the psychogenic voice disor- 
ders due to environmental stress and psychoneuroses and 
muscle tension dysphonias. This edition of Clinical Voice Dis- 
orders differs from previous editions by both the expansion 
and addition of chapters. The chapters are updated with cur- 
rent references and augmented with extended content. A 
DVD-R has been added with supplementary chapters, demon- 
stration therapy, and interviewing. More specifically, organic 
evolution has been considerably expanded to include the 
photographs and concepts fleshed out by the late Jan Wind, 
PhD, in his seminal work On the Ontogeny and the Phylogeny 
of the Human Larynx (1970, Wolters-Noordhoff; out of print). 

Embryology of the larynx has been amplified, along with 
embryology of the respiratory system, and expanded to 
include many more detailed drawings illustrating the embry- 
ologic stages of both systems. Anatomy and physiology of voice 
production was revised by Susan L Thibeault, PhD, to include 
information on the microstructure of the vocal fold. Chapters 
on spasmodic dysphonias have been combined, updated, and 
revised. A new chapter, by Brian E. Petty, on the singer’s voice 
that addresses special issues related to the treatment of 
voice disorders in this population has been added. Psychogenic 
voice disorders now has extended case studies that relate per- 
sonal stories behind this etiology of dysphonias and aphonias. 
The chapter on examination of voice disorders has been 
rewritten by Nathan V. Wellham, PhD. Greater descriptions of 
more voice therapy techniques and their underlying principles 
also have been included in this edition. 

In this work I reassert that the study of clinical voice 
disorders, and their diagnosis and treatment is now more than 
ever a medical subspecialty that requires highly specialized 
education and training of the speech language pathologist in 
order to interface with other members of the health care team. 
I now believe more firmly than ever that possession of an 
abnormal voice is either a sign of illness or an illness in and of 
itself. People with voice disorders are suffering from an illness 
in every medical sense of the word, and a patient who presents 
himself or herself to the physician or speech language pathol- 
ogist can be a major challenge to clinicians who are untrained 
in the differential diagnosis of abnormal voice signs and symp- 
toms. The analysis and interpretation of abnormal voice is such 
an esoteric subspecialty that it requires extensive experience 
with hundreds of patients before the clinician is able to pick up 
the nuances of patient responses to questions posed about 
their conditions and acoustic-perceptual shadings of voice 
produced by specific disease states. 


I believe that it is impossible to achieve such expertise 
from reading a book or listening to lectures. What is neces- 
sary is direct observation and practical contact with patients 
under the supervision of someone with extensive experience 
with voice disorders. 

As in previous editions of this book, I prepare the student 
in the gross anatomy, neuroanatomy, and physiology of the 
respiratory, phonatory, and velopharyngeal mechanisms. My 
objective is to address the needs of students who will one 
day find themselves in clinical voice situations where they 
will be responsible for answering the following diagnostic 
and therapeutic questions. 


1. What are the medical reasons responsible for this abnor- 
mal voice? 


2. How can | examine this patient in a way that will lead to 
an answer to this all important question? 


3. What kinds of information will I have to know in order to 
obtain a full understanding of this patient and his or her 
voice disorder? 


4. What do I need to know in order to reduce or eliminate 
the abnormal voice? 


In this book I will try to show how abnormal voices are 
important to the physician’s medical diagnosis and treatment, 
as well as to the speech-language pathologist’s assessment and 
treatment plan. I will try to address the speech language 
pathologist’s role. There are highly practical reasons for the 
medical speech language pathologist in a medical setting to 
have a central role in the process, because he or she is often the 
person best trained to relate the abnormal voice to the struc- 
tural problem or neurologic disease and thus serves as the 
mortar between the specialities. The etiologic diagnosis and 
therapy for voice disorders is a highly demanding subspecialty 
that requires considerable talent and skill honed by years of 
experience. And “you can’t teach experience.” An internship or 
residency in departments of otolaryngology and neurology 
would be an ideal exposure of students to the full spectrum of 
medical voice disorders at the elbow of a consulting physician. 

In this book I will try to pass on to the student whatever 
pearls of wisdom I have accumulated during my nearly 50 
years of experience with voice disorders in medicine. 

From my practice five seminal ideas have emerged: (1) 
your voice is your identity; (2) when it becomes abnormal, 
the disorder is tantamount to an illness as devastating as any 
other illness; (3) dysphonia can be a sign of an organic or 
emotional illness, or a combination of the two; (4) the voice 
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is complex; and (5) the study of voice is inseparable from the 
practice of medicine. 


1. Your voice is your identity. Of the many meanings of voice 
in our lives, perhaps the most important is that it lies at 
the very heart of our personal identity—as indelibly 
implanted in our consciousness as our faces. One of our 
patients who had been placed on voice rest after laryn- 
geal surgery complained poignantly that her life had 
become intolerable because she was unable to express 
her feelings or thoughts to her family. “My voice is me,” 
she lamented. The catastrophic effects of abnormal voice 
on the life of the patient is a recurring theme in adult 
voice disorders practice. I dramatize this fact in the auto- 
biographical expressions of a 75-year-old woman with 
adductor spasmodic dysphonia (see “Confessions of an 
Adductor Spasmodic Dysphonic” Chapter 6). 


2. When it becomes abnormal, the disorder is tantamount to 
an illness as devastating as any other illness. Early text- 
books on voice disorders centered upon abnormal voice 
as an aesthetic aberration, primarily, and not as a barrier 
to intelligibility, or a sign of organic disease, or psycho- 
logic disequilibrium, or as a diagnostic window that could 
be used to ascertain the neurologic, psychologic, general, 
or laryngologic health of the patient. Early textbooks had 
no information about the impact of voice disorders on the 
quality of life, on choice of occupations, or on social rela- 
tionships. From its infancy, the field of voice disorders has 
matured into a rich cache of knowledge and lore about 
the supernormal, normal, and abnormal voice. The 
importance of its loss now can be found in its devastating 
effects on the average speaker’s self image, as attested to 
by recent memoirs and scientific papers. 


3. Dysphonia as a sign. (A) As a sign of organic illness. Voice 
becomes abnormal when there are pathological changes of 
the vocal folds or surrounding structures due to primary 
focal laryngeal pathology, laryngeal changes due to 
endocrine, and/or changes in the innervation of the laryn- 
geal musculature secondary to neurological diseases. (B) 
As a sign of emotional illness. Voice becomes abnormal 
when there are emotional, psychological, psychiatric ill- 
nesses, or stress reflected in the voice organ. They are one 
of the most common causes of nonorganic voice disorders. 


4. Voice is complex. The study of this complex phenomenon of 
voice is termed vocology, a term coined by Ingo Titze, PhD. 
Vocology has evolved from the knowledge that voice pro- 
duction requires a wide variety of physiological activities to 
produce sounds ranging from valving pulmonary air needed 
for a soft whisper and the hush of a baby, to loud screams for 
help and the intricate control needed for singing a beautiful 
aria. Voice production is not merely a couple of folds of tis- 
sue set into vibration by a stream of air from the lungs. 
Rather, this vibratory source is an exquisitely constructed 
biomechanical histoarchitectural structure with specific 
properties that are intricately connected and affected by 
one’s internal physical and mental well-being. The study of 
voice now comes from a variety of disciplines including, but 
not limited to, communicative disorders, psychology, laryn- 
gology, bioengineering, molecular biology, theater and 
music, kinesiology, linguistics, physiology, neurology, and 


genetics. Although difficult to ascertain when the evolution 
of voice study began, it is clear that when the American 
Speech Language Hearing Association was initially formed 
over 75 years ago, voice was not a major focus. Vocology was 
nonexistent. Stuttering, speech development, and hearing 
impairment dominated early curricula. Times have changed. 


5. Clinical voice disorders are no longer speech pathology’s 
stepchild. It has grown up into a mature adult with strong 
ties to the medical specialties of otolaryngology, neurology, 
and psychiatry. Since the last edition of this book, our med- 
ical care system has continued to offer diagnoses and treat- 
ments of many different areas of human illness, including 
voice disorders. The American public now is coming to 
expect the best, most sophisticated care available for all of 
its ills. The percentage of speech-language pathologists in 
medical clinics and hospitals continues to grow at a steady 
pace. Clinicians in the schools increasingly see medically 
fragile students, who not too many years ago would have 
been seen only in a hospital setting. Because of this medical 
connection, I continue to believe in the importance of a 
solid education in the basic sciences of gross and neu- 
roanatomy and neurophysiology and in the clinical med- 
ical specialties of otolaryngology, neurology, psychiatry, 
and even internal medicine. 


Thus, it seems clearly evident that the study of voice disor- 
ders is inseparable from the practice of medicine, and that 
the SLP functions as an arm of medicine, providing differen- 
tial diagnostic information to the referring physician, as well 
as therapeutic decisions in the patient’s interest. From my 
perspective, of all the major areas of information, organic 
diseases and psychological dysfunctions are the two bodies 
of knowledge that I must master in order to practice the spe- 
cialty of clinical voice disorders competently. And that is 
what this book is all about—the organic and psychiatric 
infrastructure of normal and abnormal voice. 

It is also clearly evident to me that the theory and practice 
of voice disorders are squarely embedded in the cardinal 
medical specialties of internal medicine, pediatrics, neurol- 
ogy, psychiatry, and otolaryngology. The skilled clinician 
needs to have some knowledge and working vocabulary from 
these disciplines, be able to work with practitioners in these 
medical subspecialities, and be an astute observer of behavior. 
The skilled clinician must also be a scholar who keeps abreast 
of the ever-changing concepts in the field and in the knowledge 
of how the wonderful human body and mind work together to 
produce and correct disorders of voice production. 

I wish to express my gratitude to Dr. and Mrs. Muriel Kulwin 
for their permission to use their writings; to the late Jan Wind, 
MD, PhD, for his seminal work on the ontogeny and phylogeny 
of the human larynx, parallel in its brilliance to the research of 
Sir Victor Negus, MD, on the comparative anatomy and physi- 
ology of the human larynx; to our patients and medical col- 
leagues in the departments of neurology, psychiatry, and oto- 
laryngology; to the late Joe R. Brown, MD, Norman P. Goldstein, 
MD, Edward M. Litin, and Frederick L. Darley, PhD, of the Mayo 
Clinic; to my colleagues at the University of Wisconsin; and to 
Diane M. Bless, PhD, for helping edit the manuscript. 


Arnold E. Aronson, PhD 
Mayo Clinic, Rochester, Minnesota 
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Chapter 1 


Introduction to Clinical Voice Disorders 


It is impossible to know the fundamentals of a phenomenon 
without having solid knowledge of its origin, development 
and the chain of causes, conditions and circumstances 
determining its actual existence. 


—Kiml 


¢ The Functions of the Larynx in Living 
Creatures 


The larynx lies at the crossroads of life, a barometer of our 
physical and mental health, an airway through which flows 
life-sustaining oxygen, and a valve that protects the lungs 
from ingestion of foreign substances. When its powerful 
musculature shuts off the airway, air can be impounded in 
the lungs, forming a rigid thorax to support firmly the 
attached upper extremities during lifting and pushing. Bear- 
ing down during laryngeal closure compresses the abdomi- 
nal contents for defecation, micturition, and parturition and 
fixation of the thoracic cage for coughing, throat clearing, 
and vomiting. Yet, as vital as these functions are, the larynx 
manages to shift deftly from protector of life to communica- 
tor, generating raw sound for articulation of intellect and as 
a prosodic background to language informing the outside 
world of its owner’s personality, emotional state, and cul- 
tural heritage. It is a somewhat daunting task to attempt to 
write a book that provides the bases for voice assessment 
and treatment of this formidable organ. The mechanics of 
speech require integration of the respiratory, phonatory, res- 
onatory, and articulatory musculature. Although the empha- 
sis of this book is on voice production, we consider herein, 
for the most part, components of the total speech act: respi- 
ration, the power and driving force of sound; phonation, 
sound generated by rapid vocal fold movement driven by the 
exhaled air stream; and resonation, modification of the raw 


sound generated by the larynx by movement of the other 
articulatory structures including the velopharyngeal com- 
plex. 

The larynx is a complex organ composed of cartilage, mus- 
cles, nerves, and connective tissues. The vocal folds them- 
selves, the source of sound, are made of layers of laminated 
tissue, with a muscular layer in the innermost part of the 
folds and progressively thinner layers of softer tissues 
toward the epithelium. During vocalization, cells from the 
skin layer are destroyed and sloughed off. Vocalization 
causes cellular and extracellular damage to vocal fold tis- 
sues, which the body can repair with time. Factors such as 
genetic predisposition, internal and external vocal tissue 
environment, physical and mental health, and quantity and 
quality of vocal use are thought to contribute to the recovery 
rate of injured vocal fold tissue. 

An important and unusual fact about the larynx is that 
phylogenetically it is eons older than those brain and muscle 
structures responsible for articulation and language. The 
latter are synonymous with the intellect, itself a recent ver- 
tebrate acquisition, but the larynx and voice owe a dispro- 
portionately strong allegiance to the primitive emotions, 
persuasive proof of which can be found in everyday experi- 
ence, during emotional arousal. When we are incapable of 
preventing loss of cortical control over the larynx, it escapes 
our grasp and drags us back into its primeval depths as we 
laugh, cry, scream, groan, or are rendered voiceless. Have we 
not often witnessed such loss of intellectual speech expres- 
sion as in the following experience? A professor of law, 
known for his razor-sharp tongue and habit of taunting and 
ridiculing students (who, incidentally, loved him in spite of 
it), rose to speak at a farewell banquet: 


He stood motionless for a long time. We observed that his 
face was different than we had ever seen it before. It had 
lost its severity. It was flushed and looked pink and kind. 
The mouth was not a snarl. His voice, too, was different. It 
was soft without a trace of belligerence or sarcasm. “I can- 
not bid you all goodbye and leave unsaid that which is the 
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most important thing in my life.” He took a deep breath 
and continued, his voice hoarse with emotion. It grew 
hoarser as he struggled to eliminate the quiver, which 
was entering it. He then paid tribute to his lovely wife. “I 
cannot in this leavetaking do other than tell you that I owe 
all my happiness to her, that she has . . .” He turned. Tears 
were running down her cheeks. Their gazes met and locked 
in long silence. Then without another word, he sat down” 
(Nizer, 1978). 


We are fascinated by the mystery of how speech and lan- 
guage evolved in the kingdom of the vertebrates and have 
struggled with a basic question: Are the structure and sound- 
generating properties of the human larynx mere extensions 
of its role in lower vertebrates, the ultimate in evolutionary 
refinement for the expression of ideas? That is to say, is the 
Darwinian belief correct, that speech is not a product of spe- 
cial creation but only another form of sound-making, more 
sophisticated because of a quantitative increase in human 
intelligence? Or, is speech uniquely human, de novo, differ- 
ent in kind, not degree, from lower animals? The answer is 
unknown. 

Another, perhaps only slightly less difficult, question is 
this: Is it the monkey’s and chimpanzee’s intellect or anatomy 
that prevents speech in these primates? Some argue that the 
primitiveness of their laryngeal and articulatory anatomy 
stands in the way of speech, yet the larynges of the monkey 
and chimpanzee are remarkably similar in structure and 
method of sound production to those of humans. The fact is 
that human speech is not dependent upon a particularly 
refined larynx, for speech continues despite laryngeal tumor, 
paralysis, and even total laryngectomy, whereupon the 
esophagus or even an electronic sound generator can 
assume the responsibility for sound-making. Most scientists 
now concur that before primates could talk, a high level of 
abstract intelligence was required as the governing force. 

A universal fact about the world of living creatures is that 
almost all make sounds. Insects tap surfaces, snap or rub 
their wings, or rub leg against wing, as in the case of the 
grasshopper, producing complex trains of pulses that send 
messages of courtship and sexual recognition. Admittedly, 
many lower forms produce sound accidentally, but most do 
so for a purpose; survival-signaling fear, aggression, mating, 
territoriality, and pleasure. Situation-specific use of voice in 
birds, for example, has been firmly categorized into mating, 
distress, fear, anger, terrorizing, and triumphal calls. 

Negus (1929) observed that individual living creatures 
produce sound as both defense and offense: intimidation, 
cries for help and food, and decoying of prey. Sound brings 
and keeps the sexes together for survival of the species, 
attracting and repulsing the opposite sex, and protecting off- 
spring. Sound further enables each species to keep in touch 
when out of sight, as in grass, in trees, at night, and in lairs or 
burrows. Sound also aids in the conveyance of special ideas, 
such as cooperation, calls to food and migration, and enter- 
tainment. 

Human phonation is linked to much more than the intel- 
lectual act of speech. The human voice serves similar sublin- 
guistic purposes of survival. The larynx is an important 
escape valve for the emotions—anger, grief, and affection— 


which are essential to the maintenance of our psychologic 
equilibrium. 


¢ Definitions of Terms 


Voice 


We need to define terms used constantly in the field of voice 
disorders. To begin with, voice is an auditory perceptual term 
that means the audible sound produced by the larynx, which 
embodies such parameters as pitch, loudness, quality, and 
variability. 


+ Pitch is the perceptual correlate of frequency. 
Loudness is the perceptual correlate of intensity. 
Quality is the perceptual correlate of complexity. 


+ Variability is the perceptual correlate of variations of the 
above parameters. 


Phonation 


+ Phonation is the physical-physiologic act of sound pro- 
duction: the oscillations of the vocal folds driven by the 
exhaled air stream. Clinical terms denoting abnormal 
voice are dysphonia, aphonia, and muteness. 


+ Dysphonia describes any voice that sounds abnormal in 
its psychoacoustic parameters of pitch, loudness, quality, 
and variability; for example, excessively high or low pitch, 
inadequately or excessively loud voice, aberrant quality 
such as hoarseness, or breathiness or voices lacking in 
variability and as monopitch or in some patients exces- 
sive variability. 


+ Aphonia refers to the absence of a laryngeal tone: sounds 
like whispered speech or an extreme degree of breathi- 
ness. 


+ Muteness is a word to describe the patient who has no 
voice or articulation. 


Normal Voice 


Frequently asked is, what is a normal voice? Johnson, Brown, 
Curtis, et al. (1965) listed several criteria: 


+ Quality must be pleasant, with a certain musical quality 
and an absence of noise, inappropriate breaks, voice per- 
turbations, or atonality. 


+ Pitch must be appropriate to the age and gender of the 
speaker. 


Loudness appropriate to the communication event, not so 
weak as to be borderline intelligible or unintelligible 
under ordinary speaking circumstances, not so loud 
that it calls attention to itself, and appropriately 
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adjusted to the context of the event be it for confiden- 
tial communication, speaking in a large lecture hall, or 
yelling for help 


+ Adequate flexibility Pitch and loudness variations are 
available to express emphasis, meanings, or subtleties 
indicating individual feelings and semantic differences. 


+ Adequate sustainability Voice must meet one’s social 
and occupational needs even when extended voice is 
required. 


As we have become more sophisticated in our knowledge 
about “normal” voice production, we recognize that any nor- 
mal criteria must be adjusted for different age levels. 


Abnormal Voice 


By definition, an abnormal voice is any voice that calls atten- 
tion to itself, does not meet the occupational or social needs 
of the speaker, or is inappropriate to age, gender, or situation. 
Voices that attract unwanted attention are most often due to 
aberrant vocal qualities. 

Abnormal voice qualities are often subsumed under the 
term hoarseness, most often referring to voices that are 
noisy, atonal, or possess odd resonance patterns. A plethora 
of abnormal voice quality terms has been accumulated over 
time, not all of which are easily identified by listeners or 
agreed upon (e.g., aspirate, asthenic, breathy, choppy, coarse, 
dull, feeble, flat, gloomy, grating, grave, growling, guttural, 
harsh, hoarse, hollow, husky, infantile, lifeless, loud, metallic, 
monotonous, muffled, nasal, neurasthenic, passive, pectoral, 
pinched, rough, somber, strained, strident, subdued, thick, 
thin, throaty, tired, toneless, tremulous, tremorous, weak, 
whining, and whispered). 

Fairbanks (1960) tried to distill voice quality defects into 
three categories: harshness, breathiness, and hoarseness. 
The validity and reliability of these terms, however, were 
questioned in a study by Jensen (1965). He asked six experi- 
enced speech-language pathologists to rate the voices of 
cheerleaders who had different degrees and types of dys- 
phonia and found disagreement and inconsistency among 
their ratings. What certain judges heard as “breathiness” 
others described as “hoarseness,” whereas still others 
thought “harshness” best applied. In addition, estimations of 
voice quality were contaminated by severity. The study 
unearthed a common problem in clinical practice: individual 
voices were inconsistent throughout the sample, embodying 
more than one type of quality. The reliability of listeners’ rat- 
ings of voice quality is a central issue in voice research labo- 
ratories and the clinic because of the clinical primacy of such 
ratings and because they are the standard against which 
other measures are evaluated. An extensive literature review 
reported by Kreiman, Gerratt, Kempster, et al. (1993) clearly 
documents that both intrarater and interrater reliability fluc- 
tuate greatly from study to study. They went on to demon- 
strate with data collected in their laboratory that even 
within individual clinicians, a single voice often received 
nearly the full range of possible ratings. 

Kent (1996) attempted to define some limits to the audi- 
tory perceptual assessment of several communication prob- 


lems including voice disorders. In this review article, he 
focuses on the variety of sources of error and bias that 
clinicians are susceptible to make in auditory perceptual 
judgments. He suggests that awareness of these threats to 
validity and reliability moves us toward being more clinically 
effective and refining use of perceptual methods. A major 
factor is that the assumptions on which clinical judgments 
are predicated are often false. Listeners do not necessarily 
have a common understanding of perceptual labels, use the 
same verbal descriptors or associated scale values to assess a 
given voice, have the ability to isolate judgment of one per- 
ceptual dimension from co-occurring dimensions, have 
uniform reliability in judging voice dimensions, or have the 
ability to make perceptual judgments for which the inter- 
judge differences are smaller than the differences needed for 
clinical classification or to detect changes in clinical status. 
This lack of defined terminology, limitless variety of voice 
quality, lack of reliability, and difficulties in determining 
specificity and sensitivity led some international voice com- 
munities, namely the Japanese Society of Logopedics and 
Phoniatrics, the European Research Group on the Larynx, the 
United Kingdom British Voice Association, and the 2002 
ASHA Auditory Perceptual Judgment Consensus Conference, 
to independently reach similar conclusions concerning per- 
ceptual judgments of voice. They have suggested minimal 
standards and development of criterion-based training 
materials such as those routinely used in training speech- 
language pathologists in Australia (Oates and Russell, 1998). 
These issues will be discussed in some detail later in the 
book and the DVD accompanying the book. 

Thus, what Jensen, Kreiman, and colleagues concluded 
experimentally is substantiated clinically that the traditional 
labels used to describe voice quality deviations must be viewed 
with skepticism; one clinician's hoarseness is another’s breath- 
iness or harshness. To keep a proper perspective, however, it 
should be noted that the incongruence within terminology to 
describe the deviant voice is not as critical as it might seem 
from research studies and international consensus groups on 
voice perception. More important than terminology is the clini- 
cian’s ability to make similar judgments from test-to-test, to 
integrate perceptual, acoustic, physiologic, and psychologic 
dimensions of abnormal voice, to interpret the abnormal sound 
in light of other assessment data to determine its cause, and to 
develop a sensitivity to nuances of voice during therapy. 


+ Abnormal pitch Pitch is the perceptual counterpart of 
fundamental voice frequency. Disorders of pitch refer to 
abnormally high or low voices. 


+ Abnormal loudness Loudness is the perceptual correlate 
of the physical dimension of vocal intensity. The voice 
may be too weak or too loud. 


+ Abnormal flexibility The normal voice possesses adequate 
pitch, loudness, and quality variability during contextual 
speech to convey more subtle, intellectual, and emotional 
meanings. In voice disorders, those fluctuations may be 
either inappropriately flattened, excessive, or inconsistent. 


Rarely in clinical practice does abnormal voice vary along 
a single dimension of quality, loudness, pitch, or flexibility 
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(Hirano, 1981). Most of the time, even though one may pre- 
dominate, they are usually present in different combinations 
and proportions making it often difficult for clinicians to 
describe the abnormality. 


Abnormal Voice as a Sign of Illness 


In medicine, as well as in speech-language pathology, we 
need to distinguish between the words sign and symptom. A 
sign is a physical manifestation of an illness (e.g., hoarse- 
ness, diplophonia, or unilateral vocal fold paralysis are 
signs). The patient’s complaints of abnormal voice, pain, or 
discomfort are symptoms. Signs are objective findings, 
symptoms are the patient’s subjective interpretation of 
what is physically occurring. Faced with someone whose 
voice sounds abnormal, the clinician’s chief concern should 
be whether or not the abnormal signs and symptoms of a 
voice problem signify illness. Communicative or aesthetic 
considerations are, for the moment, secondary though in 
the long run no less important. From the laryngology syl- 
labus of a medical school curriculum comes the following 
statement pertaining to the meaning of abnormal voice in 
medical practice: 


Dysphonia is not a disease but rather a sign or symptom of 
disease of the larynx itself or along the course of the laryn- 
geal motor nerves. Thus, hoarseness is the cardinal sign of 
laryngeal involvement, often the first and only signal of 
dangerous disease, local or systemic, involving this area.... 
A thorough examination is necessary in all cases to ascer- 
tain the exact cause and prescribe the proper treatment.... 
The differential diagnosis may tax the ingenuity of the 
most exacting physician and speech pathologist since 
many different medical, surgical and non-organic condi- 
tions may cause disturbing dysphonia. 


The cause or causes of the abnormal voices need to be 
established, if at all possible. A breathy voice quality that 
appears gradually and increases in severity may have only 
minor social, communicative, or aesthetic significance yet 
may announce the onset of a brain-stem tumor. Once the 
reason for the voice disorder is known and has been elimi- 
nated as a threat to the life of the individual, the commu- 
nicative significance of the voice can then be considered. The 
modern speech clinician must be educated along medical as 
well as rehabilitative lines, however. Working with physi- 
cians, speech-language pathologists can contribute to the 
medical diagnosis of voice disorders as well as to their treat- 
ment though it must be underscored that the actual medical 
diagnosis is made by the physician. 

Often, determining the etiology must accompany early cor- 
rective procedures frequently referred to as facilitating 
approaches or trial therapy; in many cases, necessary diagnos- 
tic information can be gained only by testing (the patient’s) 
response to these rehabilitory measures; but in every case... 
a thoroughgoing investigation as to the cause of the defect... 
should be reopened (West, Kennedy, and Carr, 1947) before a 
decision is made regarding corrective devices to be employed. 
A search for the cause, or causes, should not be ended until the 
basic etiology has been uncovered; and, if at any time... it 
appears that a reappraisal of its origins would give a clearer 


picture of its nature, the search for the ultimate cause should 
be reopened (West, Kennedy, and Carr, 1947). 


Abnormal Voice as a Symptom of Illness 


Used properly, symptom refers to the patient’s subjective 
complaint, real or imagined; whether or not the clinician 
thinks the voice is abnormal is independent of the patient’s 
beliefs. Three variations on this theme occur in practice: 


1. The voice is judged defective by both clinician and patient, 
and both advocate a need for its investigation and therapy. 
Such mutual agreement is ideal for maximum coopera- 
tion in clinical diagnosis and therapy. Both parties realize 
that something needs to be done about the abnormal 
voice and assuming patient-clinician treatment goals are 
matched proceed with optimal effectiveness. 


2. The clinician is convinced of a need for voice investigation 
and therapy, but the patient is not. This situation arises 
from either (a) the clinician's unrealistic, overdetermined 
definition of abnormal voice and overemphasis on voice 
improvement or (b) the patient’s indifference to or denial 
of a genuine problem. In either instance, covert or even 
overt disagreement between patient and clinician ensues, 
and efforts at diagnosis or therapy are met with patient 
disinterest, resistance, or even hostility. 


3. The patient’s conviction that a voice disorder exists despite 
the fact that clinicians are unable to detect a major focus for 
concern. Such conflicts may be a sign of patient overreac- 
tion, a sequel to recovery from laryngologic disease or 
laryngeal surgery, or, in the case of singers, a clinician’s 
inability to detect the changes. Overreactions of patients 
are viewed as expressions of hostility, perfectionism, or 
depression, requiring psychologic assistance. It is also 
possible that the patient’s voice complaint only demon- 
strates itself under periods of stress or fatigue or is related 
more to vocal effort than vocal quality making its clinical 
description challenging. 


Abnormal Voice as a Disorder of Communication 


In addition to abnormal voice as an index of health or illness, 
it is also valued as an instrument of communication. Within 
this framework, the following questions are pertinent: (1) 
Is the voice adequate to carry language intelligibly to the lis- 
tener? (2) Are its acoustic properties aesthetically accept- 
able? (3) Does it satisfy its owner’s occupational and social 
requirements? Voice, in other words, has personal, social, 
and economic significance. This has led recently to surgical 
treatment of the larynx to produce a younger, fuller voice in 
aged individuals. The higher one ascends the socioeconomic 
scale, the greater the emphasis placed on pleasant, effective 
voices. With few exceptions, the greater the dependence on 
voice for occupational and social gratification, the more dev- 
astating the effects of a voice disorder on the person. Imag- 
ine the consequences of being a singer, actor, politician, 
clergy member, teacher, or salesperson and not having full 
voice at your command. The consequences of even a rela- 
tively minor voice defect can be devastating. 
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¢ Voice Disorder 


An abnormal-sounding voice does not always provoke indi- 
viduals to seek the services of a laryngologist or speech-lan- 
guage pathologist. Although a voice disorder exists when 
quality, pitch, loudness, or flexibility differs from the voices 
of others of similar age, sex, and cultural group, no fixed, 
uniform standard of abnormal voice exists, just as no 
absolute criterion for normal voice can be established. Con- 
sequently, a wide range of voices accepted as normal differs 
based on occupational needs and cultural expectations, and 
many dysphonic individuals are not even aware they have a 
voice problem. Those individuals who do seek professional 
help come for a variety of reasons. Some patients request 
voice services because they have noticed increased effort to 
sing or speak even though the voice “sounds normal” for 
most purposes. For others, it is a change in voice and fear of 
laryngeal cancer that causes them to seek help. Even though 
the voice may not have been a good one previously, it is 
appropriately alarming that it has changed. Others seek 
help because their voice fatigues, cannot project, or does not 
meet their vocal needs at home or work. Although layper- 
sons often pay little attention to the specific acoustic param- 
eters of voice that have caught their attention, clinicians and 
laboratory researchers are more analytic. This can lead to 
differences in therapeutic goals if the clinician is not careful 
to identify what brought the patient to the clinic in the first 
place. The layperson is often more interested in easy, effort- 
less phonation than a good quality; a singer may want to 
improve range without reducing vocal identity; a smoker 
with a rough quality may merely want to be assured he does 
not have laryngeal cancer. 


It follows that the perceived defectiveness of any one voice 
will vary among listeners. It is apparent that the voice is 
abnormal for a particular individual when he or she judges 
it to be so regardless of the circumstances. Judgment 
implies a set of standards that are learned through experi- 
ence and that are related to the judge’s own aesthetic and 
cultural criteria. Judgment also implies that standards are 
not fixed, that there is opportunity for more than one con- 
clusion. This flexibility in determining the defectiveness of 
voices does not alter the validity of the basic definition of 
voice disorders, but it does underscore the observation that 
vocal standards are culturally based and environmentally 
determined (Moore, 1971b). 


The main implication of this statement is that the defini- 
tion of a given voice as normal or abnormal depends upon 
the orientation of the person making the judgment. Child, 
parent, adult, employer, speech-language pathologist, and 
laryngologist all define normal and abnormal according to 
their own needs and backgrounds, an important point for 
the speech-language pathologist to remember, whose judg- 
ments about voice must be adjusted according to the pur- 
poses of the evaluation. The Voice Handicapped Index (VHI) 
recently developed by Jacobson and Benninger helps clini- 
cians understand the patient’s perspective and plan treat- 
ment accordingly. 


Perkins (1971) lists five kinds of information that can be 
extracted from the voice. It is an indicator of the speaker’s (1) 
physical health, (2) emotional health, (3) personality, (4) 
identity, and (5) aesthetic orientation. It is also (6) a carrier 
of connotative and denotative content. This list is important 
in that it tells us that voice has many meanings for both 
speaker and clinician and is a rich storehouse of clues to 
understanding the individual. It also underscores the impor- 
tance of taking a careful case history to obtain the informa- 
tion necessary to understanding how the voice problem 
relates to the individual and choice of treatment. 


Classifications 
Etiologic 


Etiology means the underlying cause or hypothesis that 
explains signs and symptoms. Because such a classification 
encourages the deepest understanding of dysphonia or 
aphonia, it will be the primary category used in this book. 
For example, a neurologic disease, such as trauma to one 
recurrent laryngeal nerve, is the etiology of a voice disorder. 
It causes a pathophysiologic condition, unilateral vocal fold 
paralysis, which in turn causes the abnormal voice signs of 
breathiness, hoarseness, reduced loudness, diplophonia, and 
reduced variability (Moore, 1971a). 

Etiologic diagnosis of voice disorders is imperative when 
speech-language pathology is practiced within the frame- 
work of medicine. The teaching that voice signs, such as 
breathiness and hoarseness, are “disorders” in and of them- 
selves is contrary to the purposes of educating one to under- 
stand the individual behind the abnormal voice. Attempts to 
change the voice alone without understanding its etiology 
and concomitant maintenance factors are the most common 
reasons for failure in diagnosis and therapy, especially 
within the realm of nonorganic voice signs and symptoms. 


Perceptual 


Voice disorders can be classified according to their acoustic 
perceptual attributes: quality, pitch, loudness, and flexibility. 
Although this system has the advantage of emphasizing the 
voice characteristics perceived by the listener, if used alone it 
has the disadvantage of failing to provide sufficient informa- 
tion about the underlying causes and muscular dynamics 
responsible for the abnormal voice. 


Kinesiologic 


Exemplified by the terms vocal hyperfunction and vocal hypo- 
function, this classification categorizes voice disorders 
according to whether the vocal folds overadduct or under- 
adduct, thus closing the glottis too tightly or incompletely. 
Although not without merit, this classification, if used exclu- 
sively, oversimplifies the complexities of laryngeal diseases, 
placing excess emphasis on the degree of approximation of 
the glottal margins rather than on the multiple causes of such 
approximation defects. Furthermore, it neglects to recognize 
that many persons with so-called hypofunctional problems 
compensate by overadduction of the supraglottal structures. 
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Table 1.1 List of Some Etiologies of Organic Voice Disorders 


Congenital Disorders 
Atresia 
Cri du chat 
Laryngomalacia 
Subglottic stenosis 
Laryngeal web 
Laryngeal cleft 
Papilloma 
Laryngocele 
Mongolism 
Laryngeal saccule 
Lymphangioma 
Subglottic hemangioma; ectopic thyroid gland 
Inflammation 
Laryngitis 
Tuberculosis 
Fungal laryngotracheobronchitis 
Epiglottitis 
Streptococcus 
Staphylococcus 
Scarlet fever 
Pertussis 
Typhoid fever 
Histoplasmosis 
Blastomycosis 
Reflux esophagitis 
Metabolic Disorders 
Hypothyroidism 
Hyperthyroidism 
Amyloidosis 
Lipoid proteinosis 
Wegener’s granulomatosis 
Rheumatoid arthritis 
Benign Neoplasms 
Papilloma 
Chondroma 
Fibroma 
Cysts 


Etiology 
Organic Voice Disorders 


A voice disorder is organic if it is caused by structural 
(anatomic) or physiologic disease, either a disease of the lar- 
ynx itself or by remote systemic or neurologic diseases that 
alter laryngeal structure or function. Table 1.1 lists various 
causes of organic voice disorders. 


Psychogenic Voice Disorders 


Common synonyms for psychogenic are functional and 
nonorganic. Psychogenic voice disorders include disorders of 
quality, pitch, loudness, and flexibility caused by psychoneu- 
roses, personality disorders, or faulty habits of voice usage 
(Table 1.2). The voice is abnormal despite normal laryngeal 
anatomy and physiology. 


Malignant Neoplasms 
Squamous cell carcinoma 
Adenocarcinoma 
Fibrosarcoma 
Lymphoma 
Neurofibrosarcoma 
Malignant melanoma 
Neurolemmoma 
Spindle cell carcinoma 
Trauma 
Contact ulcer 
Vocal nodule 
Postintubation granuloma 
Mucosal burns 
Sulcus Vocalis 
Presbylarynges 
Neurologic Disorders 
Bilateral upper motor neuron (spastic) disorder 
Mixed lower-upper motor neuron (flaccid-spastic) disorder 
Basal ganglia (hypokineticparkinsonism) disorder 
Essential tremor 
Cerebellar (ataxic) disorder 
Basal ganglia (hyperkineticchorea) disorder 
Basal ganglia (hyperkineticdystonia) disorder 
Brain-stem (palatopharyngolaryngeal myoclonus) disorder 
Brain-stem (organic voice tremor) disorder 
Gilles de la Tourette’s disease 
Apraxia of phonation 
Akinetic mutism 
Foreign dialect syndrome 
Jugular foramen syndrome 
Adductor spasmodic dysphonia, essential tremor type 
Adductor spasmodic dysphonia, dystonia type 
Abductor spasmodic dysphonia, essential tremor type 
Abductor spasmodic dysphonia, dystonia type 


Table 1.2 List of Etiologies of Psychogenic Voice Disorders 
Emotional Stress-Musculoskeletal Tension 


Voice disorders without secondary laryngeal pathology 
Voice disorders with secondary laryngeal pathology 
Vocal abuse 

Vocal nodule 

Contact ulcer 

Vocal fatigue 


Psychoneurosis 


Conversion disorder 

Mutism 

Aphonia 

Muscle tension dysphonia 

Psychosexual conflict 

Mutational falsetto (puberphonia) 

Dysphonia associated with conflict of sex identification. 
latrogenic dysphonia 
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Voice Disorders of Multiple Etiology 


Voice disorders are complex and often do not have a single eti- 
ology. For example, spasmodic dysphonia, including adductor, 
abductor, and mixed adductor-abductor types, can be neuro- 
logic, psychogenic, or of unknown (idiopathic) etiology. Per- 
sons with a clear organic etiology may be so depressed by 
their dysphonia that the depression compounds the voice 
problem becoming a clear secondary etiology. Still other indi- 
viduals with voice problems may present with compensatory 
strategies developed from an organic problem, such as laryn- 
gitis that has long since resolved, and often secondary to 
anatomic changes from laryngeal or other surgeries. 


Incidence 


The National Institute of Deafness and Communicative Disor- 
ders, a division of the National Institutes of Health, estimates 
7.5 million individuals have diseases or disorders of the voice 
caused by overuse of the vocal folds, upper respiratory infec- 
tions, vocal fold lesions, laryngeal cancer, and other laryngeal 
pathologies (American Speech Language Hearing Association, 
2002). Differences in vocal expectations (occupational, social, 
and cultural) make it difficult to determine how these inci- 
dence figures relate to such factors as age, vocal use, and 
occupation. Interpretation of incidence figures is further 
complicated by differences in methodology and definitions 
used by investigators as to what constitutes a voice disorder. 
Thus, it is not surprising to see estimates of voice disorder in 
total population ranging from 3 to 9% (Ramig and Verdolini, 
1998), in children ranging from 3 to 24% (Senturia and Wil- 
son, 1968; Silverman and Zimmer, 1975; Yairi, Currin, Bulian, 
et al. 1974). Koufman and Blalock (1991) suggest at least 10% 
of these are functional in origin, but here, too, the estimates 
range widely. We will not try to rectify the reported differ- 
ences but rather present some representative studies. 

Statistics on the incidence of voice disorders in school-age 
children suggest most children with voice disorders have 
dysphonia related to vocal abuse, either with or without 
resultant vocal fold nodules or general inflammation (Cooper, 
1973; Pannbacker, 1999). In contrast with the studies on 
adults, the studies on children have been drawn from large 
bases. For example, Senturia and Wilson’s (1968) study of 
32,500 school-aged children in St. Louis, Missouri, demon- 
strated 6% had voice disorders. Other studies report a some- 
what higher level. Yairi, Currin, Bulian, et al. (1974), based on 
a study of 1500 school-age children, found an incidence of 
hoarseness in 13%. The highest incidence was reported by Sil- 
verman and Zimmer (1975), who found voice disorders in 
23.4% of schoolchildren. Whether these differences are 
related to actual differences in the children tested or criteria 
for voice disorder could not be determined. 

Many of the studies reported for adults have been done in 
voice centers or specific occupational settings and the num- 
bers extrapolated to the general population. One of the few 
incidence studies of adults, performed by Laguaite (1972), 
found that of 428 patients aged 18 to 82 years, 7.2% of the 
males and 5% of the females had voice disorders. In his oto- 
laryngologic practice, Brodnitz (1971), reporting on only 
“functional” voice disorders, found that in 1851 cases, 25.8% 


had hyperfunctional (musculoskeletal tension) voice disor- 
ders, 19.7% had polyps, 15.3% had vocal nodules, 9.4% had 
polypoid thickening, 5.3% had contact ulcers, 4.7% had muta- 
tional voice disorders, 4.7% had spastic dysphonia, 4.4% had 
psychogenic aphonia, and the remaining patients had voice 
disorders from other causes less common. 

Representative statistics on the incidence, or prevalence, 
of abnormal voice in otolaryngologic practice are meager. In 
a study of the prevalence of laryngeal pathologies in medi- 
cine according to sex, age, and occupation, Herrington-Hall, 
Lee, Stemple, et al. (1988) investigated 1262 patients drawn 
from several otolaryngologic practices. They found that the 
most common disorders were vocal nodules, 21.6%; edema, 
14.1%; polyps, 11.4%; cancer, 9.7%; vocal fold paralysis, 8.1%; 
and dysphonia without laryngeal pathology, 7.9%. Laryngeal 
pathologies occurred mostly in the older age groups; 57% of 
patients were over 45 years of age, with 22.4% over age 64 
years. In a study by Dobres, Lee, Stemple, et al. (1990), the 
most common disorders in children were subglottal tracheal 
stenosis, vocal nodules, laryngomalacia, nonorganic voice 
disorders, and vocal fold paralysis. Kaufman and colleagues 
have reported several studies on the prevalence of the spec- 
trum of dysphonia (Koufman, Blalock, 1991; Koufman and 
Isaacson, 1991; Koufman, 1991), including one one ocupa- 
tions and pathology in a otolaryngology practice. She also 
looked at occupation and pathology in an otolaryngology 
practice. She divided her adult clinical population into four 
primary conditions and found that the total added up to 
more than 100% because many patients exhibit more than 
one problem. She reported that the Wake Forest Voice Center 
population broke down into 100% infectious and inflamma- 
tory conditions, 42% vocal misuse/abuse syndromes, 39% 
benign and malignant growths, 17% neuromuscular disease, 
and 3% psychogenic disorders. Clearly, laryngeal diseases do 
not occur with equal frequency across age and gender. 
Between 60% and 80% of papillomatosis occurs in children 
before the age of 3 years. Spastic dysphonia usually appears 
between 30 and 50 years of age in females. Nodules occur 
frequently in children and young adult women but rarely 
occur in adult males. According to Dobres, Lee, Stemple, el al. 
(1990), vocal nodules and edema were most common in 
early adulthood (22 to 44 years); polyps and dysphonia, 
despite a normal laryngeal examination, were most common 
during middle adulthood (45 to 64 years); and vocal fold 
paralysis in late adulthood (over age 64 years). Cancer was 
evenly distributed between the ages of 45 to 64 and over age 
64 years. In males, vocal nodules were most common 
between 0 and 14 years, edema between 25 and 44 years, 
polyps between 45 and 64 years, and vocal fold paralysis pri- 
marily after age 64 years. 

No summary of the incidence and prevalence of voice dis- 
orders would be complete without some information on 
how the figures relate to occupation. In the total working 
population in the United States, ~25% have jobs that criti- 
cally require voice use, and 3% of the population have occu- 
pations in which their voice is necessary for public safety 
(National Institute on Deafness and Other Communication 
Disorders, 1999). Thus, it is not surprising in clinical practice 
to find certain occupations where vocal abuse, emotional 
stress, and ambient noise are prevalent and individuals 
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prone to develop voice disorders (e.g., classroom teachers, 
actors, singers, telemarketers, auctioneers, factory workers, 
athletic coaches, and the clergy). A landmark study by Titze, 
Lemke, and Montequin (1997) compared U.S. workers who 
rely on voice to fulfill professional tasks with their appearance 
in voice centers. Teachers comprise ~20% of the patient load in 
voice clinics though they only constitute 4% of the workforce. 
Sapir, Keidar, and Mathers-Schmidt (1993) and Smith, Lemke, 
Taylor, et al. (1998), researching voice problems in teachers, 
support the notion that the incidence of voice problems in 
teachers is high. These investigators surveyed teachers and 
found that one third to one half report voice problems char- 
acterized by symptoms of hoarseness, difficulty with pitch, 
voice fatigue, weakness, and effortful production. Elementary 
and secondary schoolteachers were 32 times more likely to 
report these symptoms than a random sample of people in 
other occupations. Nearly indentical results were reported in 
Poland (Sliwinska-Kowalska, Niebudek-Bogusz, Fisher et al, 
2006). This high occurrence is unfortunate because it has also 
been estimated that 75% of teachers’ problems could be pre- 
vented or self-rehabilitated (Voice Academy, 2003). There 
are several obvious factors for why teachers are at risk for 
developing voice problems. They use their voices for pro- 
longed periods, often over background noise, with little 
recovery time. They are also exposed to upper respiratory 
tract infections by their close contact with children, have 
limited time-off for illnesses, and often work in unfavor- 
able environments having poor acoustics, chalk dust, low 
humidity levels, and noisy and inefficient heating systems. 
Mothers of small children are also prone to develop vocal 
nodules. Mothers and teachers have much in common. 
Although mothers may not have chalk dust to contend with, 
they often develop voice problems secondary to vocal stress 
from yelling at home and during sporting events of their 
children. 

Of 73 different occupations in the total sample, the most 
commonly occurring ones were retired persons, homemak- 
ers, factory workers, unemployed persons, executives or 
managers, teachers, students, secretaries, singers, and nurses. 
Factory workers were third on the list of the top five occupa- 
tions with a high incidence of laryngeal pathologies. These 
findings are consistent with a National Institute of Neurolog- 
ical Disorders and Stroke (2000) report that there is evidence 
of an increased incidence of vocal nodules in such occupa- 
tions as homemakers, teachers, singers, lawyers, salespeople, 
preachers, and telemarketers. It is not surprising to read that 
persons in “talking professions” or persons who work in fac- 
tories around noise commonly have voice problems. What is 
surprising is that retired and unemployed persons commonly 
have voice problems. It may be explained in part by Smith, 
Gray, Dove, et al. (1997): in a study of teachers, 40% expressed 
concern that voice problems might adversely affect future 
career options, who were almost twice as likely as nonteach- 
ers to state that difficulty with voice may be a problem in 
pursuing their profession of choice. Retired and unemployed 
persons with voice problems may represent individuals who 
quit “talking” careers because of voice difficulties. The retired 
persons may also be exhibiting problems related to the 
anatomic and physiologic changes that co-occur with age. In 


Koufman’s clinical population, 45% were elite vocal perform- 
ers and professional voice users such as clergy and teachers 
and 43% were non-vocal professionals whose job perform- 
ance was affected by voice. 

In Dorbe’s patients diagnosed as having psychogenic voice 
disorders, 85% were female. With respect to occupational 
background, 35% of psychogenic voice disorders were found 
in female homemakers. This gender-biased result may be 
skewed by the fact that, in general, females seek medical 
attention with greater frequency than do males. 

Several factors need to be considered in interpreting inci- 
dence studies. The time, place, population, definition of 
voice problems, and assessment tools all affect the outcome 
and explain the wide variance in reported results. Ten years 
ago, investigators were unaware of the importance of 
esophageal reflux on laryngeal tissue, subtle neurologic 
signs often went undetected, and videostroboscopy and 
other means of detecting minor structural deviations and 
lesions were often not included in test batteries. We are left 
to conclude that incidence studies are sorely needed, that 
the incidence and types of voice problems vary across life- 
span changes and occupations, and that persons working in 
voice-dependent occupations and those working around 
noisy or polluted environments are at greater risk. 


+ 


Summary 


+ The vital functions of the larynx are to protect the lungs 
from foreign substances, to stabilize the thoracic cage 
during work done with the upper extremities, and to 
compress the abdominal contents. 


¢ Phonation is a secondary laryngeal function and has 
primitive survival, emotive, entertainment (e.g., singing), 
and higher linguistic purposes. 


+ Normal voice is one in which voice quality, pitch, loud- 
ness, and flexibility are reasonably pleasing and audible 
to the listener. 


+ Abnormal voice is defined as deviations in quality, pitch, 
loudness, or flexibility that may signify illness and inter- 
fere with communication or a person’s occupational or 
social needs. 


Voice disorders are classified into organic types, which 
consist of dysphonias or aphonia caused by mass lesions, 
infections, or neurologic disease; psychogenic types, 
which include abnormal voice resulting from psychoneu- 
rosis, psychosis, or faulty habit patterns; or complex 
when the problem is multidimensional. 


The incidence of voice disorders is not well documented. 
It is estimated in school-age children to be 6 to 9% and 
possibly as high as 24%. These are mostly dysphonias 
caused by vocal abuse. The incidence of voice disorders in 
adults differs by age group, gender, occupation, and 
pathology, and thus contains nearly the entire spectrum 
of organic and psychogenic voice disorders. 
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Chapter 2 


Normal Voice Development 


But what am I? An infant crying in the night: An infant cry- 
ing for the light: And with no language but a cry. 


—Tennyson, Lord Alfred 


+ Infancy 


The Postnatal Larynx 


By the third month of fetal life, the larynx has the same features 
recognizable at birth but is a structure quite different from the 
adult larynx. At birth, the 2-cm-long by 2-cm-wide larynx is 
estimated to be one third the dimension of the adult larynx, is 
extremely soft and malleable, does not possess a vocal fold 
layer structure, and shows no sex differentiation (Bosma, 1985; 
Kent and Vorperian, 1995; Tucker and Tucker, 1979; Verhulst, 
1987). The laryngeal evolution from this embryonic state 
through senescence is central to voice, swallow, and respira- 
tion. Woisard, Serrano, and Pessey (1996) divide the laryngeal 
developmental modifications affecting the evolution of a child’s 
voice into topographic (laryngeal descent in the neck), morpho- 
logic (increase in volume modification of laryngeal shape), and 
histologic (lamina propria differentiation). The chronological 
voice evolution primed by the central nervous system inte- 
grates these modifications in development. 

Of these three modifications, topographic evolution 
appears to be the most controversial. It is commonly 
believed that phonation is phylogenetically a recent function 
of the larynx, made possible by the laryngeal descent, and is 
the reason humans can speak. Not all evidence supports this 
contention. During the neonatal cry, only gross vertical 
movements of the larynx and mandibular movements are 
possible, yet Menard, Schwartz, and Boe (2004) have shown 
that anatomy does not prevent even the youngest speaker 
from producing vowels perceived as the 10 French oral 
vowels though they recognize that the specific configuration 
of the vocal tract for the newborn seems to favor the produc- 


10 


tion of low and front vowels. Lieberman, Harris, Wolff, et al. 
(1971) have observed that these limited movements and the 
uniform cross-sectional configuration of the supralaryngeal 
tract bear a striking resemblance to those of primates. At 
birth, the neonate’s larynx occupies a position higher in the 
neck, relative to the palate and mandible, than at any other 
time in life. In fact, it is sufficiently high that the epiglottis is 
in contact with the velum allowing infants to carry out the 
functions of breathing and feeding simultaneously (Kent and 
Vorperian, 1995). Almost immediately after birth, it begins 
its descent in the neck. Lieberman, McCarthy, Hiiemae, et al. 
(2001) have suggested that spatial constraints related to 
deglutition impose greater restrictions on the rate and 
degree of hyo-laryngeal descent than do adaptations for 
vocalization, highlighting the different functional roles of 
the hyoid during voicing and oral transport. From a longitu- 
dinal series of lateral radiographs taken between the ages of 
1 month and 15 years, these investigators observed vocal 
tract shape changed markedly during ontogeny, especially in 
the first postnatal year and during the adolescent growth 
spurt. The ratio of pharynx height to oral cavity length 
decreased from 1.5 to 1.0 between birth and 6 to 8 years, after 
which it remained stable. The superoinferior spatial rela- 
tions between the positions of the vocal folds, the hyoid 
body, the mandible, and the hard palate did not change sig- 
nificantly throughout the entire postnatal growth period. 
Figure 2.1 shows that at birth, the lower border of the cricoid 
cartilage is level between cervical vertebrae 3 and 4(C3 and 
C4). By age 5, the larynx has descended almost to the level 
of C7. Between ages 15 and 20, it remains at C7. Although 
this developmental phenomenon has been reported to be 
unique to humans, Nishimura, Mikami, Suzuki, et al. (2005) 
found the laryngeal descent during infancy in chimpanzee 
infants was similar to that in human infants. Fitch and Reby 
(2001) also argue that laryngeal descent is not uniquely 
human. Using bioacoustic analyses of red and fallow deer 
coupled with anatomic analyses of functional morphology, 
they have documented these species lower the larynx in a 
manner similar to that exhibited by humans. They suggest 
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that laryngeal descent serves to elongate the vocal tract, 
allowing callers to exaggerate their perceived body size by 
decreasing vocal-tract resonant frequencies. This exagger- 
ated body size is consistent with work done by biologists 
Evans, Neave, and Wakelin (2006) who propose that a deep 
male voice may have once played a role in courtship and 
competitive behavior in humans by attracting female mates 
and indicating body size to male competitors. Thus, it seems 
clear that there is a developmental relationship between the 
descent of the larynx and the decrease in average voice pitch 
(as the pharyngeal tube elongates, it resonates lower funda- 
mental frequencies) and that the descent is not unique to 
humans. Less clear is the evolutionary cause of the descent, 
which is currently thought to have first evolved for swallow- 
ing and secondarily for voice. 

Morphologic changes begin at birth and continue through- 
out life. At birth, the thyroid cartilage is contiguous with the 
hyoid bone. The laryngeal skeleton then separates in a cranio- 
caudal direction. The infant’s epiglottis is bulky at this stage 
when seen superiorly. Together with the thick aryepiglottic 
folds, it is omega-shaped and lies over the dorsum of the 
tongue. During postnatal development, the epiglottis under- 
goes several changes, [enlarging epiglottis becomes firmer, 
and omega-shaped appearance is reduced, ] reaching maturity 
at puberty (Kent and Vorperian, 1995). The configuration of 
the alae of the thyroid cartilage changes from a rounded shield 
during fetal life to an angle of ~90 to 100 degrees in the male 
and 120 degrees in the female at birth. During puberty, the 
angle in the male thyroid alae narrows to 90 degrees, whereas 
in the female it remains nearly the same. Increase in length of 
the thyroid cartilage is nearly 3 times greater in males than in 
females (Kahane, 1982). During the first years of life, the open- 
ing into the larynx widens, changing from its T-shape to one 
that is more rounded or oval. 

The laryngeal airway is maintained by the circumference of 
the cricoid cartilage. However, the soft cartilages of the neona- 
tal larynx and lax supporting ligaments predispose the infant’s 





Fig.2.1 Vertical descent of the larynx during life. 
The illustration demonstrates the relationship of 
the lower border of the cricoid cartilage to the cer- 
vical vertebrae at various ages. (Adapted from 
Wind, J. [1970]. On the phylogeny and the ontogeny 
of the human larynx. Groningen: Wolters-Noordhoff 
Publishing, p. 105.) 
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larynx to collapse if negative air pressures become excessive 
within the internal vocal tract. Because the subepithelial tis- 
sues are less dense and more abundant and vascular, they have 
a tendency to accumulate tissue fluids, accounting for the high 
incidence in infants of infraglottic and supraglottic obstruction 
due to inflammatory edema. Ossification of the hyoid bone 
begins by age 2. Onset of ossification of the cartilage is contro- 
versial (Kent and Vorperian, 1995). Claassen and Kirsch (1994) 
suggest it begins soon after puberty when the larynx reaches 
its final size, which would be ~15 to 20 years of age, but 
Kahane (1983) and Sato, Kurita, Hirano, et al. (1990) suggest it 
begins around the third decade for males and fourth decade for 
females. It is generally agreed that the onset of ossification 
begins in the hyaline cartilage first with the thyroid followed 
by the cricoid. The hyaline and elastic arytenoid ossify later, 
but by the seventh or eighth decade, all the laryngeal 
cartilage, except the elastic epiglottis, cuneiforms, and cor- 
niculates, have ossified in males. The female larynx never com- 
pletely ossifies (Claassen and Kirsch, 1994). 

Vocal fold length, glottal width, and infraglottal sagittal and 
transverse dimensions at infancy, puberty, and adulthood are 
given in Table 2.1. The membranous and cartilaginous por- 
tions of the vocal folds are equal in length in infancy creating 
a proportionately larger glottis in the infant, but by adulthood, 
the membranous portion with greater pliability has elongated 
to approximately two thirds of the total glottal length. Glottal 
width and infraglottal dimensions also increase with age. 

Histologic modifications occur coincidental with the topo- 
graphic and morphologic changes contributing to the evolu- 
tion of voice. Most notable are changes that occur in the 
delicate histoarchitecture of the vocal folds, which consist of 
the thyroarytenoid and mucosa. At birth, the mucosa is 
thicker than that in adults and, in contrast with the layered 
lamina propria seen in adults, the infant mucosa is uniform 
and does not begin forming layers until somewhere between 
the ages of 1 to 4 years (Hirano and Sato, 1993; Hirano, 
Kurita, and Nakashima, 1983; Kahane, 1988). The adult 
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Table 2.1 Dimensions of the Larynx* 


Adult 

Infancy Puberty Male Female 

(mm) (mm) (mm) (mm) 
Vocal cord length 6-8 12-15 17-23 12.5-17 
Membranous portion 3-4 7-8 11.5-16 8-11.5 
Cartilaginous portion 3-4 5-7 5.5-7 4.5-5.5 
Glottis width at rest 3 5 8 6 
Maximum 6 12 19 13 
Infraglottis sagittal 5-7 15 25 18 
Transverse 5-7 15 24 17 


Source: Ballenger, J.J. (1969). Diseases of the nose, throat and ear. Philadelphia: 
Lea & Febiger, p. 275. 

*Measurements from other studies differ somewhat from those reported above 
but all concur that there is a continual increase in vocal fold length from birth to 
puberty with sex differences appearing between 6 and 10 years of age (Kent and 
Vorperian, 1995). 


differentiation between lamina propria layers is evident by 
puberty, but the histologic modifications continue through- 
out life with geriatrics exhibiting thicker edematous lamina 
propria with decreased density of elastin and collagen fibers. 

Histologic changes in the vocalis muscle are also abun- 
dant. Hirano, Kurita, and Nakashima (1983) noted that the 
muscle fibers were thin in newborns, fully developed by 27 
years and somewhat atrophied by the fifth decade of life. The 
newborn infant has more type II (long prolonged contrac- 
tion) than type I (short fast contraction) muscles. Type I 
muscles become more prominent with age contributing to 
increased control of vocal modifications and prolonged vocal 
expression (Woisard et al., 2006). Structural differences are 
not limited to developmental age related changes: Xue and 
Hao (2006), using acoustic pharyngometry to develop a 
database of the geometry of the vocal tract, found significant 
differences related to gender and race. 


Infant Cries 


The birth cry after the neonate has emerged from the womb 
signals the first conventional appearance of voice in humans. 
Nevertheless, human cries from the fetus in utero are known 
to have occurred. Thiery, Yo Le Sian, Vrijens, et al. (1973) doc- 
umented such a case in a 31-year-old woman in her fifth 
pregnancy. When the fetal head was 3 cm above the ischial 
spines, the membranes ruptured, releasing 2000 mL of clear 
fluid. According to Thiery, “While the fetal head was being dis- 
placed to allow the drainage of as much amniotic fluid as pos- 
sible, the fetus started crying. This sound was clearly heard by 
the mother, three physicians, and two midwives. The crying 
recurred six to seven times at intervals of up to 20 seconds.” 
Do normal infants have distinctively different cries, and, if 
so, are they indicative of definitive physiologic and emo- 
tional states? Is it possible to differentiate types of infant 
cries from acoustic spectrographic data and by listening to 
recordings? Are the cries of infants who are ill distinguish- 
able from normal infant cries, based on spectrographic data 
and listener judgment? In perhaps the earliest comprehen- 


sive study done of normal infant cries, Wasz-Höckert, Lind, 
Vuorenkoski, et al. (1968) obtained spectrographic and lis- 
tener judgment data based on 419 cries of 351 healthy 
infants. Listeners were able to identify four distinctive 
types of cries from birth through 7 months. 


1. Birth signal Duration +1 second; flat of falling melody; 
usually voiceless; always strained or strident; contains 
glottal plosives 


2. Pain signal Long duration; usually falling melody; high- 
pitched; strident 


3. Hunger signal Pitch rising-falling; frequent glottal plosives 


4. Pleasure signal Flat pitch; hypernasal; greater pitch vari- 
ability than the other types of cries; rare glottal plosives; 
never voiceless; never strident 


Because the cries are distinct LaGasse, Neal, and Lester 
(2005) have suggested that infant cry provides a noninvasive 
window into the neurologic and medical status of the infant 
because the acoustic characteristics of the cry from infants 
with neurologic damage, sudden infant death syndrome, pre- 
maturity, and various medical conditions deviate from typical 
cry characteristics. Robb (2003) similarly noted neuromatura- 
tional differences in cries of pre- and full-term infants. The 
crying episodes of full-term infants contained a greater num- 
ber of bifurcations and aperiodic segments. Wermke, Mende, 
Manfredi, et al. (2002) interpreted their longitudinal data on 
complexity of cry melodies in three pairs of healthy identical 
twins as support that acoustic analysis of cries reflects neuro- 
muscular maturation and is a good indication for evaluation of 
prespeech development. These authors recorded the twins at 
8th to 9th, 15th to 17th, and 23rd to 24th weeks of postnatal 
development. Over the period of recording time, cry melodies 
were increasingly complex. An increasing coupling and tuning 
between melody and resonance frequencies began during the 
second recording period at the 15th to 17th weeks. 

As early as infancy, cries illustrate a universal fact about 
the larynx: the true and false vocal folds respond differen- 
tially, depending upon psychophysiologic state. The pain cry 
is strained, tense, harsh, and forced owing to massive, tight, 
effortful closure of the entire laryngeal tract (i.e., it is an 
undifferentiated adduction of both the true and false vocal 
folds). The pleasure cry, more sonorous, lax, and devoid of 
strain of tension, is produced by the true vocal folds alone. 
Throughout life, the voice is smooth, clear, sonorous, devoid 
of strain or effort, and filled with rich inflectional patterns 
during pleasure and freedom from anxiety, fear, hostility, or 
depression. However, it is harsh, strained, and lacking in 
inflectional patterns during times of situational stress both 
in normal persons and in persons with some psychiatric ill- 
nesses. Clarici, Travan, Accardo, et al. (2002) attempted to 
determine if the crying of a newborn child was an alarm 
signal or protocommunication. Acoustic analyses compar- 
ing spontaneous cries in the cradle with pain conditions 
demonstrated that the newborn infants modulated the 
supralaryngeal tract more in cries after painful stimulus than 
in spontaneous cries. The authors proposed that these data 
supported the hypothesis that crying is protolanguage 
meant to communicate meaning. 
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In his review article on the signal functions of early infant 
crying, Soltis (2004) offers an interesting evolutionary 
perspective. He states that infant cries are in large part adap- 
tations that maintain proximity to and elicit care from care- 
givers. He continues that infant cries may function as a 
graded signal, as evidenced by the observation that during 
pain-induced autonomic nervous system arousal, neural 
input to the larynx increases tension and cry pitch. Increases 
in amounts of crying during the first few months of life he 
claims is a human universal, and potential signal functions of 
this excessive crying have been hypothesized to be manipu- 
lation of parents to acquire additional resources, honest sig- 
naling of need, and honest signaling of vigor. Of these 
hypotheses, he suggests evidence is most consistent with 
excessive crying as a signal of vigor that evolved to reduce 
the risk of a reduction or withdrawal of parental care. 


Fundamental Frequency 


Life begins with a voice fundamental frequency 300 to 400 Hz 
above that which is ultimately achieved by adulthood. In a 
1968 study of 419 cries from birth to 7 months by Wasz-Hock- 
ert et al., the measured frequencies varied according to the 
type of cry: 77 birth cries averaged 500 Hz, 60 pain cries 
between birth and 1 month averaged 530 Hz, 72 pleasure cries 
averaged 440 Hz. There was little difference for recordings on 
the same type of cry for infants between 1 and 7 months. 
Michelsson et al. (2002) analyzed a total of 1,836 cry signals 
from 172 healthy babies between 1 and 7 days old. Measuring 
8-15 cry samples from each infant, the authors reported that 
each infant’s cries varied between 450 and 520 Hz. In their 
sample of infants, 93% had cries with a mean Fo below 600 Hz 
and a range of over 200 Hz; mean value of the highest point of 
Fo was 583+ —151 Hz and the lowest point 398+—75 Hz. The 
Fo contour was generally a rising-falling contour, and there 
were no significant differences in cry according to gender or 
age. This large-scale study is consistent with earlier cross-sec- 
tional studies completed in the 1960s: Ringel and Kuppel 
(1964) found that the fundamental frequency of pain cries 
from 10 normal infants was 413 Hz; Ostwald, Phibbs, and Fox 
(1968), in a study of the pain cries of five infants, obtained ini- 
tial, highest, and final mean fundamental frequencies of 418 
Hz, 506 Hz, and 393 Hz, respectively, for each cry. 

In a longitudinal study of infant vocalizations recorded 
during the first 9 months of life Greene (1972) noted a pro- 
gressive increase in the range of inflectional glides during 
cooing and babbling. The earliest appearance of the upward 
glide, C3-C#, was in a 2-week-old girl, whose range 
increased from C3 to E3 by 7 weeks of age. The voice of an 
infant male at 5 weeks ranged from C# to F# in one breath, 
and at 16 weeks his musical inflections varied, rising and 
falling between C2# and E. Upward glides appeared first, 
then rising and falling glides increased in quantity and range 
up to an octave at 6 to 7 months of age. 

Acoustic features of comfort-state vocalization of infants 
3,5, and 9 months were studied by Kent and Murray (1982). 
Fundamental frequency measures were similar to those 
previously reported for 3-month-old infants (445 Hz), 
6-month-old infants (450 Hz), and 9 month-old infants (415 
Hz). Similar to cry data, the vocalizations were associated 


with tremor, harmonic doubling, abrupt Fo shift, vocal fry, 
and noise segments. 


Duration 


Wasz-Höckert, Lind, Vuorenkoski, et al. (1968) found that 
between birth and 1 month of age, pain cries (2.6 seconds) 
averaged approximately twice the average duration of hunger 
cries (1.3 seconds). The duration of cries between 1 and 7 
months of age were found to be nearly identical to those 
recorded for newborns. Similarly, Ringel and Kuppel (1964) 
and Ostwald, Phibbs, and Fox (1968) both found that pain 
cries averaged 1.5 seconds in length. Similarly Michelsson, 
Eklund, Leppanen, et al. (2002) reported cry duration of 1- to 
7-day-old infants was 1.4 + 0.6 seconds. Collectively, these 
figures indicate that on average, cries can be as short as 0.2 
second and as long as 2.7 seconds with pain cries being 
longer in duration than those recorded for hunger. Duration 
of comfort-state vocalizations for infants 3, 6, and 9 months 
old is generally less than 400 milliseconds but occasionally as 
long as 2 seconds (Kent and Murray, 1982). 


Listener Identification of Infant Cries 


Apparently, listeners are able to identify birth, pain, hunger, 
and pleasure cries beyond chance odds, and training 
improves this skill. Wasz-Höckert, Lind, Vuorenkoski, et al. 
(1968) had 349 women identify and categorize 24 infant 
birth, pain, hunger, and pleasure cries. The results showed 
that pleasure cries were identified most accurately, followed 
by hunger, pain, and birth cries. However, the accuracy of 
judgments depended upon the rater’s background. Ranked 
according to decreasing order of accuracy were (1) mid- 
wives; (2) children’s nurses; (3) mothers; (4) registered 
nurses; (5) other women experienced in childcare; (6) 
women with no experience in child care. Overall, there was a 
high degree of accuracy with all raters, even though rater 
groups differed significantly from one another. 

Hollien and Müller (1973) took issue with the conclusions 
of Wasz-Höckert and colleagues that certain cries are associ- 
ated with specific stimulus situations and that recognizable 
perceptual differences exist between cries. They argued that 
(1) birth cries and “pleasure” vocalizations ought not to have 
been juxtaposed in the design of the study; (2) preselection of 
“typical” samples of cries for listener judgments may have 
biased the listeners; (3) cry duration and infant age ought to 
have been controlled experimentally and statistically. In their 
own study, Hollien and Müller recorded cries in response to 
pain, auditory, and hunger stimuli from four male and four 
female infants, aged 3 to 5 months. Two groups of mothers lis- 
tened: one, the real mothers of the eight infants, and the other, 
mothers of infants of comparable age. Although statistical 
analysis proved that some of the hunger cries were correctly 
identified, a significant number of times, the listeners overi- 
dentified cries of all types as hunger cries, possibly revealing a 
bias toward judgments of this kind. From the study, the 
authors concluded that the acoustic characteristics of normal 
infant cries carry little perceptual information to the mother 
about the situation that evoked the cry response. However, 
the experimenters were less than completely confident in 
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their conclusions, calling them “tentative” and advocating the 
need for “additional investigations.” Moreover, as research has 
demonstrated significant durational differences between 
cries, Hollien and Miiller’s durational control may have elimi- 
nated an important acoustic cue in cry identification. An addi- 
tional interesting finding of this study was that the mothers of 
the infants in the study had little difficulty recognizing which 
samples were produced by their own infants. 

Thus, although caregiver perception of cry is far from clear, 
many authors suggest infant cries provide signal function that 
caregivers can reliably assess. This notion has been the impe- 
tus for the development of pain assessment tools for clinical 
assessment in the neonatal intensive care units (Bellieni, Sisto, 
Cordelli, et al. 2004; McNair, Ballantyne, Dionne, et al. 2004; 
Spence, Gillies, Harrison, et al. 2005). 


Facial Expression Associated with Infant Cries 


Infant cries and other vocalizations are inseparable from 
associated facial expressions and bodily movements. Young 
and Décarie (1977) coded the facial/vocal behavior and body 
movements of 75 male and female infants, aged 9 to 14 months, 
who were exposed to six different types of emotionally 
arousing situations. From their study, they were able to iden- 
tify two main classes of vocalizations: 


1. Positive vocalizations. Those associated with smiling facial 
expressions (e.g., babbling, cooing, laughing, and squealing) 


2. Negative vocalizations. Those associated with grimacing, 
trembling of the lips, and frowning (e.g., harsh wailing, 
wailing without harsh voice quality, and soft wailing) 


They then categorized these vocalizations into the 
following: 


+ Babbling Vocalization of low intensity, moderate pitch, 
and a soft-sounding sequence of consonant-vowel combi- 
nations, generally pleasurable to the listener, produced 
during prolonged exhalation and in quick sequence 


+ Coo Sustained, moderately high-pitched voice of mod- 
erate intensity, having a smooth onset, pleasant to the lis- 
tener, and produced during prolonged exhalation 


¢ Laugh Rapid repetitions of short exhalations of moder- 
ate pitch and harsh quality; pleasant to hear 


@ Squeal Extremely high-pitched, of moderate intensity, 
variable in duration, positive, and produced during 
extended exhalation 


+ Harsh wail Begins with breath-holding of sudden onset 
and has a rhythmic piercing sound, a definitely negative 
quality, and is harsh, irritating, and painful to hear 


+ Soft wail Begins with breath-holding, has a soft but neg- 
ative quality 


@ Wail Hard, high-pitched, discordant 


Physiologic Bases of the Infant Cry and Intonation 


Little is known of the aerodynamic, respiratory, and neuro- 
muscular bases of the infant cry. Fluctuations from periodic 
to aperiodic acoustic spectra in such cries have been noted 


by Lieberman, Harris, Wolff, et al. (1971), who attributed 
these variations to the infant’s inability to adjust the vocal 
folds to rising subglottic air pressures, the soft laryngeal tis- 
sues being unable to withstand such pressures. 

In normal children and adults, the fundamental frequency 
of the voice drops toward the end of a sustained utterance or 
breath group owing to the decline of infraglottal air pressure 
as exhalation progresses and there is less respiratory sup- 
port. Lieberman (1967) traces the origins of this universal 
human tendency to early infancy. Citing Bosma, Lind, and 
Truby (1964), he notes that the shape of the fundamental fre- 
quency contour of the cry is similar to that of the typical 
esophageal pressure contour. The amount of expiratory mus- 
cle activity is directly related to the infant’s excitability; as 
greater thoracic contraction produces greater subglottal air 
pressure, infant cries show higher fundamental frequency 
and intensity. The fundamental frequency falls as the breath 
group ends, the lungs being depleted of air at the end of the 
exhalation. There thus appears to be an innate physiologic 
basis for intonation. According to Lieberman (1967), “The 
infant’s hypothetical innate referential breath-group fur- 
nishes the basis for the universal acoustic properties of the 
normal breath-group that is used to segment speech into 
sentences in so many languages.” 

Infants respond to intonation of voice before they are able 
to comprehend language. This was first demonstrated by 
Lewis (1936), who showed that a 10-month-old male and a 
13-month-old female changed the fundamental frequencies 
of their voices depending upon whether they were playing 
with their mother (higher-pitched voice) or their father 
(lower-pitched voice). Thus, early in language development, 
before the actual appearance of distinctive linguistic fea- 
tures, intonation apparently takes on imitation and/or sym- 
bolic meaning. 

Lewis identified three intonational stages in the develop- 
ment of language: 


1. At an early stage, the child shows broad discrimination 
between different patterns of expression in intonation. 


2. When the total pattern (phonetic form plus intonational 
form) emerges because of language learning, intonation, 
not the phonetic pattern, dominates the child’s response 
at first. 


3. Finally, the phonetic pattern becomes the dominant fea- 
ture. Although the intonational pattern is subordinated, it 
never completely disappears. 


¢ Childhood 


Fundamental Frequency 


Fundamental frequency reflects vibratory patterns of the 
vocal fold and its elasticity, length, and vocal fold shape and 
thickness. Fundamental frequency of children has been the 
focus of hundreds of research studies initiated in the 1940s 
and continuing into the present time, where they have been 
the fodder for numerous theses and dissertations (e.g. Curry, 
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1940, 1949; Fairbanks, Herbert and Hammond, 1949; Fuchs 
et al. 2007). Generally, researchers agree that Fo decreases 
from the 500-Hz cry of an infant to around 300 Hz for the 
toddler. By prepubescence, Fo drops nearly 50% to around 
250 Hz (Busby and Plat, 1995; Curry, 1940; Hasek, Sadanand, 
and Murry, 1980; McGlone and McGlone, 1972; Smith and 
Gray, 2002; Stathopoulos, 2002) for both boys and girls. The 
most marked adolescent Fo drop occurs in boys and is 
thought to be related to the laryngeal growth changes result- 
ing from hormonal secretions of adrenal androgens (Smith 
and Gray, 2002). Comparison of the frequency/pitch in the 
male at puberty and at the termination of adolescence indi- 
cates that the male voice drops approximately one octave. 
Curry (1940) found the median fundamental frequency in 
10- and 14-year-old boys to be 269 Hz and 241 Hz, respec- 
tively, whereas in 18-year-old males the frequency 
descended to 137 Hz. Hollien and Malcik (1967) found the 
median fundamental frequency in 18-year-old males to be 
126 Hz. These figures are similar to those reported by White- 
side and Hodgson (2000), Whiteside (2001), and Whiteside, 
Hodgson, and Tapster (2002). Thus, for males we see a nearly 
50% drop from the infant cry (500 Hz) to childhood voice 
(250 Hz) and another 50% drop from childhood voice to 
adulthood (125 Hz). The change in Fo for females is less dra- 
matic but also reaches adult maturity during puberty. 
McGlone and Hollien (1963) found that female voices 
dropped 2.4 semitones between ages 7 and 8 to 11 and 15 
years. In a study of 15-, 16-, and 17-year-old females, Michel, 
Hollien, and Moore (1966) reported average fundamental 
frequencies of 207 Hz; their similarities indicating that by 
age 15, mutational change of voice is essentially completed 
in females. After pubertal changes, the voice remains fairly 
stable until around the sixth decade when postmenopausal 
voice may lower in some females and around the seventh 
decade when the male Fo may rise slightly. 

During the transition from the child to adult voice, audible 
voice breaks are uncommon, and they almost always occur 
in males, rarely in females. Pitch breaks can span an entire 
octave, transcending vocal registers from high-pitched 
falsetto to bass. Hoarseness from chronic laryngitis is com- 
mon, particularly if vocal abuse occurs during the period of 
voice change. If Curry’s (1949) figures are indicative, there is, 
apparently, a high prevalence of transient dysphonia during 
adolescent voice change, as he found that 80% of the voices 
of 14-year-old boys were of hoarse-husky quality. 

In brief, it seems clear that the fundamental frequency of the 
human voice descends with age. The most notable changes 
occur between birth and adolescence, paralleling the descent 
of the larynx in the neck and growth of the laryngeal structure. 
Despite hundreds of studies on fundamental frequency during 
childhood, data are incomplete; we do not yet have an unbro- 
ken continuity of data. Because each study varies in methodol- 
ogy—some employing oral reading and others contextual 
speech—interpretation of results must be guarded. 


Frequency Range 


Phonational frequency range is the difference in frequency 
between the lowest sustainable tone in the modal register 
and the highest falsetto. Studies of children’s frequency 
(pitch is its physical correlate) ranges are incomplete, com- 


plicated by mixed purposes and methodologies, some being 
designed for gathering musical information and others for 
physiologic data. Nevertheless, a picture of frequency range 
begins to emerge. Van Oordt and Drost (1963) studied 126 
children, dividing them into two groups: birth through 5 
years of age (45 children) and 6 through 16 years of age (81 
children). The children aged 6 through 16 were asked to sing 
from an arbitrary tone up to the highest they could reach and 
then down to the lowest one. From this method, the 
researchers derived (1) highest musical tone (the highest 
tone having musical quality) and (2) highest physiologic tone 
(the highest tone attainable, regardless of musical quality or 
clarity). The lowest musical tone and the lowest physiologic 
tone were based on similar judgments. Figure 2.2 shows the 
comparison between the physiologic voice (frequency) 
range and the musical voice (frequency) range. This figure 
illustrates that the physiologic voice range remains constant 
from ages 6 to 16 (~21/2 octaves) whereas the musical voice 
range expands. 

Developmental changes in voice are not limited to changes 
in frequency. Aerodynamic and acoustic results, reported by 
Stathopoulos and Sapienza (1997) in a cross-sectional study 
that included children aged 4 to 14 years and adults, show 
that men and 14-year-old boys function differently than 
women and all other groups of children. Children’s respira- 
tory values are similar to those of adults by the time they are 
12 to 14 years of age. 


+ Puberty and Adolescence 


Anatomy and Physiology of Adolescent Voice Change 


Until puberty, the larynx is of equal size in the male and the 
female as discussed elsewhere in this text. As shown in Table 
2.1, at puberty the male laryngeal growth outdistances the 
female particularly in the anteroposterior dimensions. 


+ By adulthood, the membranous portions of the male vocal 
folds range from 11.5 to 16 mm (in length, a 4- to 8-mm 
increase from puberty). However, on average, the female 
membranous vocal folds increase in length to only 8 to 
11.5 mm (a 1- to 3.5-mm increase). 


+ The dimensions of the infraglottal sagittal and transverse 
planes grow in the male to 25 mm, respectively, and in 
the female to 18 mm and 17 mm, respectively. 


+ Although there is considerable gender overlap, the angle 
of the thyroid lamina decreases until it becomes ~90 
degrees, whereas in the female larynx, the angle remains 
~120 degrees. 


+ The laryngeal mucosa loses some of its transparency and 
becomes stronger; the epiglottis flattens, increases in 
size, and elevates; and the tonsils and adenoids partially 
atrophy. 


+ The larynx descends in the neck. The neck itself elongates. 
Because of the greater enlargement of the thorax in most 
males, there also is a more prominent increase in vital 
capacity than seen in females. 
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Age of Onset and Duration of Adolescent 
Voice Change 


Evidence indicates that onset of puberty occurs earlier in 
warmer climates and with children who are heavier in 
weight. In temperate climates, onset of puberty in females 
ranges from ages 12 to 14 and in males from ages 13 to 15. 
Near the Equator, onset is accelerated 1 to 2 years, and near 
the poles it is delayed 1 year. Onset of puberty also may be 
delayed in females by excessive exercise or nutritional defi- 
ciencies such as occur with anorexia and bulimia. 

The average time from onset to completion of adolescent 
voice change is 3 to 6 months, 1 year at most. Fuchs, 
Froehlich, Hentschel, et al. (2007) claim that acoustic fea- 
tures of voice change begin appearing 5 to 7 months before 
onset. They divide pubertal voice change into three time 
periods: premutation, mutation when the voice is most 
unstable, and postmutation. In females, the voice change is 
complete by age 15 and in males by age 14 or 15. Although 
the onset of voice change occurs earlier in females, males 
and females complete the change at approximately the same 
age. 


Adolescent Voice Frequency Change 


Frequency/pitch distinction between male and female 
begins during puberty and continues throughout adoles- 
cence. These voice changes—the result of growth of the 
phonatory, resonatory, and respiratory anatomy—roughly 
parallel the appearance and development of the secondary 
sex characteristics. The pitch and quality changes that occur 
at puberty are more apparent in males than in females 
because of the greater magnitude of the pitch change, which 
is approximately one octave in males. The voice of males at 
puberty often begins with a husky quality and an unsteady 





Fig. 2.2 Comparison between physi- 
ologic and musical voice frequency 
range. Physiologic frequency range 
(PVR) is represented by total area and 
musical frequency range (MVR) by 
darker area. Both are bordered by 
regression lines of highest and lowest 
registered physiologic and musical 
tones plotted logarithmically. (From 
van Oordt, H.W.A., Drost, H.A. [1963]. 
Development of the frequency range 
of the voice in children. Folia Phoniat 
15, 289-298.) 


pitch, oscillating perhaps one to two tones. Although the 
pitch fluctuates from day to day, the general trend is down- 
ward. With time, the high tones become less steady, the low 
tones more stable. Fuchs, Froehlich, Hentschel, et al. (2007) 
reported on a 3-year longitudinal study of 21 male members 
of a professional boys’ choir from which they concluded that 
acoustic analyses could be used to predict mutational 
change in the speaking voice because nonaudible significant 
changes of acoustic features appeared 5 to 7 months before 
mutation onset. 


Stormy Voice Mutation 


Stormy voice mutation refers to pervasive sudden voice 
breaks from high to low pitch, or the reverse, or excessively 
husky or hoarse voice associated with adolescent voice 
change. Although there may be some hoarseness or huski- 
ness to the voice quality and increased perturbations, it 
should be emphasized that the majority of male adolescents 
have uneventful voice change. 


¢ Adulthood 


Fundamental Frequency 


Research on the fundamental frequency of the speaking 
voice for the age continuum in adults is also far from com- 
plete because the instrumentation, procedures, time of day, 
and subject inclusion/exclusion criteria used to obtain fun- 
damental frequency vary among studies. Despite these 
limitations, the fundamental frequency values obtained by 
independent researchers are similar within gender groups. 
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Table 2.2 Male Mean Fundamental Frequency (Hz) 
as a Function of Age Comparative Studies 


Mean 
Age Fundamental 
Range Frequency 


(Years) No. (Hz) Investigators 














20-29 175 120 Hollien and Shipp (1972) 
27 119 Hanley (1951) 
157 128 Hollien and Jackson (1973) 
24 132 Philhour (1948) 
6 132 Pronovost (1942) 
103 138 Majewski, Hollien, Zalewski 
(1972) 
30-39 175 112 Hollien and Shipp (1972) 
40-49 175 107 Hollien and Shipp (1972) 
39 113 Mysak (1959) 
50-59 175 118 Hollien and Shipp (1972) 
60-69 175 112 Hollien and Shipp (1972) 
70-79 175 132 Hollien and Shipp (1972) 
39 124 Mysak (1959) 
80-89 175 146 Hollien and Shipp (1972) 
39 141 Mysak (1959) 


We have arbitrarily taken 69 years as a cutoff point in our 
discussion of adult fundamental frequency because we wish 
to discuss a succeeding category that might be designated 
“older age.” 

A summary of several major fundamental frequency stud- 
ies, conducted in the middle of the 20th century, of the male 
speaking voice in adult males is found in Table 2.2. From this 
table, it can be seen that most early studies were of males 
within the 20- to 29-year age group. The group mean funda- 
mental frequency for all studies within this age group is 128 
Hz, and the range is 119 to 138 Hz. Similar values (Fo = 119 
Hz) were reported by Naufel de Felippe, Grillo, and Grechi 
(2006) for males between 20 to 45 years of age without signs 
or symptoms of vocal problems. Studies by Hollien and Shipp 
(1972) and Wang, Gao, Yang, et al. (2006) show that in the 
seventh decade, the fundamental frequency of males begins 
to rise. Critics suggest studies using college student volun- 
teers should be regarded with caution because they repre- 
sented a narrow segment of the population, and this data 
might be different from average individuals. Hollien, Hollien, 
and de Jong (1997) put this argument to rest. They reported 
that university students were found to be slightly larger than 
a cohort approaching the average population, but only minor 
insignificant vocal differences were found. 

Data on the fundamental frequency of the speaking voice 
in adult females indicate that as a group, their voices average 
1.6 times higher than those of males. Kelley (1977) studied 70 
females, aged 20 to 90 years, whose mean fundamental fre- 
quencies and ranges are given in Table 2.3. An important 
finding of this study was a strong correlation between age 
and Fo with fundamental frequency in females decreasing 
throughout the age range 20 to 70 years. These data are at 
variance with the results of studies by McGlone and Hollien 
(1963), who found no significant change in mean fundamen- 


Table 2.3 Female Mean Fundamental Frequency (Hz) 
as a Function of Age 


Age Range Mean Fundamental 
(Years) No. Frequency (Hz) 
20-29 10 227 

30-39 10 214 

40-49 10 214 

50-59 10 214 

60-69 10 209 

70-79 10 206 

80-90 10 197 


Source: Kelley, A. (1977). Fundamental frequency measurement of female voices 
from twenty to ninety years of age. (Unpublished manuscript.) Greensboro, NC: 
University of North Carolina. 


tal frequency in females from early adulthood through 
advanced age. In addition, Saxman and Burk (1967) found 
that in females, mean fundamental frequency decreases dur- 
ing the middle years and then increases in old age. The reason 
for these differences has not been adequately documented 
but is hypothesized to relate to changes in the anatomic 
structure of the vocal apparatus. Additionally, oral reading 
results in higher mean speaking fundamental frequencies 
than those for spontaneous speech (Hollien, Hollien, and 
de Jong, 1997) suggesting different tasks may have con- 
tributed to differences. Also, the variability between subjects 
is sufficiently high that it may be necessary to record larger 
sample sizes with enough power to take into account race, 
hormones, culture, age, and general health. 


+ Older Age 


Having documented the decline in the fundamental frequency 
of the human voice from infancy through adulthood, we now 
ask, “What happens to the voice of the aged?” Does it attest to 
signs of aging organism? Can we hear differences between 
younger and older voices? Are there physical changes in the 
structure of the larynx or its innervation that might account 
for such voice changes? Information accumulated to date 
indicates an affirmative answer to all of these questions. 


Physical Changes 


Several studies show structural changes of both the respira- 
tory system and the vocal folds with age. Atrophy of the 
intrinsic laryngeal muscles, thinning and dehydration of the 
laryngeal mucosa, loss of elasticity of ligaments, qualitative 
and quantitative changes in collagenous fibers resulting in 
changes in the viscoelasticity of the vocal folds, calcification 
of cartilages, flaccidity and bowing of the vocal folds, and 
edema have been documented by Ferreri (1959), Jackson and 
Jackson (1959), Keleman and Pressman (1955), Myerson 
(1976) and Sato, Hirano, and Nakashima (2002). 

In a study of men and women aged 69 to 85 years, 
Honjo and Isshiki (1980) saw a yellowish or dark-grayish 
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discoloration of the vocal folds in 39% of the men and 47% of 
the women. Edema of the vocal folds was noted in 74% of 
the women and 56% of men. Vocal fold atrophy was found 
in 67% of the men and 26% of the women, inferred from 
either bowing of the edges of the vocal folds, visibility of 
the ventricle, or prominence of the contour of the vocal 
processes of the arytenoid cartilages. Glottal gap was 
observed in 67% of the men as semicircular (22%), linear 
(28%), or partial (17%). However, this finding was less com- 
mon in women, 58%, among whom the semicircular gap was 
particularly rare. Vocal sulcus was noted in ~10% of both 
men and women. 

These authors concluded that the discoloration of the 
vocal folds found in approximately one half of the subjects 
was due to either fat degeneration or keratosis of the mucous 
membrane that lines the larynx. Vocal fold atrophy and glot- 
tal gap in men were attributed to senescent changes of the 
entire thyroarytenoid muscle and mucous membrane, and 
edema of the vocal folds in women was attributed to general 
endocrine changes after menopause. This increase in the 
mass of the vocal folds may be the cause for noticeable low- 
ering of average fundamental frequency and roughness and 
hoarseness of voice in aged women. 

Morrison and Gore-Hickman (1986) found that it is the 
older men with atrophy and the older women with poly- 
poidal degeneration who usually complain about their 
abnormal voices. These authors found that older patients 
have a tendency to misuse their voices, their phonation com- 
monly associated with hyperactivity of the ventricular folds. 
They speculate that this vocal misuse or abuse may have 
psychogenic bases or may be due to an attempt to compen- 
sate for atrophy or polypoidal changes. Thus, they raise the 
strong possibility that psychologic distress may make an 
important contribution to dysphonia in older persons, owing 
to tension, hypochondriasis, multiple medications and 
health problems, reduced finances, and depression. This 
neglected population, although they may not complain of 
depression or anger, may nevertheless be reacting to loneli- 
ness, isolation, family separation, and conflicts with spouses 
who are also aging. 

In a study of the stroboscopic movements of the vocal 
folds in 20 young adult women aged 22 to 28 years and 20 
older women aged 60 to 77 years, Biever and Bless (1989) 
found aperiodic vibration in 85% of the older women but in 
only 30% of the younger ones and a posterior glottal gap 
common to both age groups. Electroglottography (EGG) 
recorded on geriatrics (Higgins and Saxman (1991); Winkler 
and Sendlmeier, 2006) showed a decreased EGG open quo- 
tient (OQ) with increased age that was positively related to 
perceived age. 


+ Perceptual Studies of Voice 
Sound Changes 


Listeners can accurately detect differences in chronological 
age by the sound of the voice, based on ratings of contextual 


speech and vowel prolongation (Ptacek and Sander, 1966; 
Ryan and Burk, 1974; Ryan and Capadano, 1978; Shipp and 
Hollien, 1969). Studies have also shown that the acoustic 
physical properties of voice change with advancing age 
(Ferrand, 2002; Kent and Burkhard, 1981; Mueller, 1997; 
Verdonck-de Leeuw and Mahieu, 2004; Watson, 1998). 
Sustained phonation in older age groups has been described 
as hoarse, breathy, and tremorous by Ptacek and Sander 
(1966), Hartman and Danhauer (1976), and Ryan and 
Capadano (1978), characterized by overall deterioration of 
the acoustic signal (Verdonck-de Leeuw and Mahieu, 2004), 
lowering of harmonics-to-noise ratio (Ferrand, 2002), 
decreases in Fo during devoicing (Watson, 1998), and 
increases in variability of voice onset time (VOT) (Sweeting 
and Baken, 1982). 

The notion that voice deterioration is a given with geri- 
atrics was challenged by Ramig and Ringel (1983), who 
found that whether or not the voice of an aging person was 
judged abnormal depended on that person’s general physical 
health. Older subjects in poor physiologic condition had sta- 
tistically significantly more jitter, shimmer, and spectral 
noise in their voices than did those in good physical condi- 
tion. Peppard and Bless (1987) compared the voices of geri- 
atric singers with nonsinger geriatric and young adults. They 
found that the voices of the geriatric singers did not deterio- 
rate and were more like those of young adults than the 
voices of their aged-matched peers suggesting vocal exer- 
cises may prevent, or at least deter, age-related changes in 
voice production. 

In a subsequent study, Ramig, Gray, Baker, et al. (2001) 
compared speakers’ physiologic condition with listeners’ 
ratings of age according to resting heart rate, systolic blood 
pressure, diastolic blood pressure, forced vital capacity, and 
adjusted forced vital capacity. They found no statistically sig- 
nificant relationship between ratings of age and actual 
chronologic age in patients 25 to 75 years who were in good 
physiologic condition but did find a statistically significant 
relationship between age and judgment of voices as poor in 
patients who were in poor condition. Listeners were more 
accurate in their ratings of age when judging connected 
speech in comparison with vowel prolongation. They were, 
however, more accurate in identifying aged speakers on the 
basis of vowel prolongation who were in poor physiologic 
condition in comparison with older speakers who were in 
good condition. Similarly, Orlikoff (1990) in his study of the 
relationship of age and cardiovascular health to certain 
acoustic characteristics of male voices noted that the phona- 
tions of atherosclerotic individuals were generally associ- 
ated with greater short- and long-term variability compared 
with healthy elderly and that perturbation values were 
higher in both geriatric groups compared with their younger 
cohorts. These data seem to support the importance of good 
general health in the preservation of youthful voice in older 
age groups. 

From these data, it can then be deduced that an older 
sounding voice is not inevitable as we age; how we sound 
may be closely tied to the general health and physiologic 
reserve of the person as well as vocal exercise. What also 
needs to be studied in greater depth, however, is the effect of 
psychologic changes on voice in the elderly. Physical and 
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social isolation restricting the amount of daily voice use in 
the elderly may have deleterious effects on voice. 


Fundamental Frequency 


It was already noted that Hollien and Shipp (1972) demon- 
strated a progressive lowering of fundamental frequency 
in males from early adulthood to the 40- to 50-year age 
range, but above this range there was a reversal of this 
downward trend. The saucer-shaped scatterplot of 175 
male subjects can be seen in Fig. 2.3. The same trend has 
been demonstrated in a study by Mysak (1959) of 12 males 
aged 65 to 79 years and 12 males aged 80 to 92 years. He 
obtained mean impromptu speaking voice fundamental 
frequencies of 120 Hz for the 65- to 79-year age group and 
137 Hz for the 80- to 92-year age group. The males aged 80 
to 92 years had statistically significantly higher fundamen- 
tal frequencies than those in the 65- to 79-year age group. 
Similarly, in a study by Honjo and Isshiki (1980), the aver- 
age fundamental frequency of 20 males aged 69 to 85 years 
was 162 Hz, considerably higher than that for young males 
(120 to 130 Hz). 

Fundamental frequency of the speaking voice as a function 
of age in females does not follow the same trend as in males. 
In a group of females aged 65 to 79 years studied by McGlone 
and Hollien (1963), the average frequency was 196 Hz, and in 
another group aged 80 to 94 years, 199 Hz was the average. In 
the study by Kelley (1977), not only did fundamental fre- 
quency fail to rise in older-aged females, it declined. In the 
study by Honjo and Isshiki (1980) of 20 females aged 69 to 85 
years, fundamental frequency also showed a failure to 
increase; it actually decreased from ~260 Hz in young 
females to an average of 170 Hz in the older age group. The 
increase in fundamental frequency in males and its failure to 
increase in females is hypothesized to be related to gender dif- 
ferences in the aging structure. 





Fig. 2.3 Male fundamental frequency as a function 
of age. Scatterplot of the mean fundamental fre- 
quencies of the speaking voices of 175 subjects. Solid 
line connects mean values for each age decade. 
(From Hollien, H., Shipp, T. [1972]. Speaking funda- 
mental frequency and chronologic age in males. 
J Speech Hear Res 15, 155-159.) 
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Frequency Range 


A study by Hollien, Dew, and Philips (1971) of 534 subjects 
(332 males and 202 females) is still one of the most extensive 
works available on phonational frequency range in adults. 
The males ranged in age from 16 to 59 years and the females 
from 18 to 75 years. Subjects had to match voice pitches to 
series of pure tones downward and upward from a predeter- 
mined frequency. Figure 2.4 shows the frequency ranges for 
each subject. Individual ranges extended from approxi- 
mately one octave (13 semitones) to more than 4.5 octaves 
(55 semitones). Females produced a greater range than 
males, extending from nearly two frequency ranges for 
males and females were 38 semitones, and both groups 
exhibited large standard deviations. From these cross-sec- 
tional data, there appeared to be no age-related trend 
though others have reported it expands during puberty and 
reduces with old age. These ranges are slightly greater when 
compared with the studies of fewer subjects done by Ptacek, 
Sander, Maloney, and Jackson (1966), Hollien and Michel 
(1968), and Colton (1959). The negative consequence of 
vocal aging on quality of life is significant according to 
results of a longitudinal study completed by Verdonck-de 
Leeuw and Mahieu (2004) and underscores the importance 
of voice studies. 


Developmental Changes Reflected in Other 
Voice Measures 


Measures of fundamental frequency are not the only measures 
thought to reflect changes related to gender, age, race, native 
language, general health, and task. Because individual speak- 
ers have unique ways of achieving the same phonatory targets, 
neither Fo nor any other single analysis procedure is sufficient 
to identify age. Vocal age is effected by vocal use, biological 
condition, and disease. If we are to identify measures that have 
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(A) Phonational frequency ranges of 202 adult females. (B) Phonational frequency ranges of 332 adult males. (From 


Hollien, H., Dew, D., Philips, P. [1971]. Phonational frequency ranges of adults. J Speech Hear Res 14, 755-760.) 


both specificity and sensitive to life-span changes, it is neces- 
sary to take these factors into consideration. 

Knowledge of the lack of sensitivity and specificity of single 
vocal measures to age has led to a plethora of multidimensional 
analyses. As previously stated, acoustic analyses including per- 
turbation values, phonation gaps, intensity, variability of fre- 
quency, harmonics-to-noise ratio, and dynamic range are not 
frequently used in the clinic. Another example, Hwa-Chen 
(2007, 2008) determined that speakers of tonal language had a 
greater frequency/intensity profile than did nontonal language 
speakers though their mean fundamental frequencies were sim- 
ilar. Gorham-Rowan and Laures-Gore (2006) reported signifi- 
cant age-related differences in Fo standard deviation, amplitude 
perturbation quotient, and noise-to-harmonic ratio. Videostro- 
boscopic images of geriatric speakers also differed from their 
younger counterparts (Biever and Bless, 1989; Linville (1992, 
1995); Bloch and Behrman, 2001) but when singing training was 
taken into account, they were similar (Peppard and Bless, 1991; 
Peppard, 1991; Peppard, Bless, and Milenkovic, 1988). 


+ 


Summary 


Birth, pain, hunger, and pleasure cries can be differenti- 
ated from one another. 


From birth, the larynx enlarges and descends in the neck, 
and with these growth changes the fundamental fre- 
quency (pitch) of the voice decreases. 


Male-female fundamental frequency differences are 
insignificant until puberty, at which time the male voice 
descends a full octave, whereas the female voice drops 
three to five semitones. 


Acoustic signs of puberty occur 3 to 4 months before its 
onset. 


Fundamental frequency may not be as specific or sensi- 
tive to age-related changes in the voice as other acoustic 
measures. 
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+ In male and female adults, the fundamental frequency of 
the voice descends throughout life. However, in older age, 
the fundamental frequency of the male voice begins to 
ascend, whereas in the female voice it does not. 


+ The vocal folds undergo structural changes during older 
age, consisting of atrophy, thinning, edema, both quanti- 
tative and qualitative changes in the extracellular matri- 
ces of the vocal folds, and dehydration of the mucosa cov- 
ering the vocal folds. 
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Chapter 3 


Voice Disorders of Structural Origin 


+ Voice Disorders Due to Structural 
Changes in Childhood 


Structural changes of the vocal folds at any age produce one 
or more of the following pathologic changes: 


increase or decrease the mass or bulk of the vocal folds or 
immediately surrounding tissues 


alter their shape 
restrict their mobility 
change their tension 


modify the size or shape, or both, of the glottic, supraglot- 
tic, or infraglottic airway 


prevent the vocal folds from approximating partially or 
completely along the vibratory edge 


+ result in excessive tightness or irregularity of approxima- 
tion 


+ result in irregular chaotic vibration 


Laryngeal problems of a structural nature may be either 
congenital or acquired. In either case, they result in a disor- 
der of pitch, loudness, or quality. One of the most unusual 
and rare structural voice disorders that affects pitch, loud- 
ness, and quality is that of cri du chat, or cat cry. 


Cri du Chat 


Infants who are ill or who have structural differences in the 
larynx cry differently from normal infants, and although cer- 
tain types of cries are associated with specific illnesses in the 
newborn, nonspecific distress cries are often produced at 
higher than normal fundamental frequency (Mallard and 
Daniloff, 1973). 


24 


The syndrome of cri du chat in neonates and children was so 
named because the voice has a distinctive, high-pitched, plain- 
tive wail resembling the cry of a cat, which is immediately rec- 
ognizable as different from other infants. For example, the 
attendants in a newborn nursery had nicknamed a child “kit- 
ten” because of her weak, high-pitched mewing cry (Ward, 
Engel, and Nance, 1968). Similarly, a telephone repairman, not 
seeing the crying child, remarked to the foster mother that she 
had “an awfully angry cat” (Dumars, Gaskill, and Kitzmiller, 
1964). This voice signature of the syndrome is a prime example 
of the diagnostic usefulness of voice. Three cases of cri du chat 
were first reported by Lejeune et al. (1963), a professor of 
genetics at the University of Paris. More recently, Mainardi, 
Pastore, Castronovo, et al. (2006) reported on the genetic char- 
acteristics of 220 patients diagnosed with cri-du-chat syn- 
drome, which is a contiguous gene syndrome that results from 
a deletion of the short arm of chromosome 5 (5p) as illustrated 
in Fig. 3.1. Molecular-cytogenetic investigation (flourescent 
in-situ hybridization, FISH) showed a variety of aberrations 
including interstitial deletions, short terminal deletions, and 
other rare rearrangements of 5p. Cytogenetic and clinical vari- 
ability in this population are the rule rather than the exception. 
The cat-like cry and peculiar timbre of voice were the most 
typical signs of the syndrome, not only at birth but also later, 
and these were the only signs that suggested the diagnosis in 
patients with small deletions and a mild clinical picture. The 
unusual voice, however, was only one of many genetic defects 
associated with this rare syndrome. In addition to abnormal 
laryngeal development, sucking and feeding difficulties, and 
respiratory infections, the syndrome presents with microg- 
nathia, a bird- or beak-like profile, microcephaly, hypotonia, 
hypertelorism, downward-slanting palpebral fissures, devel- 
opmental disabilities, low-set ears, strabismus, midline oral 
clefts, and failure to thrive. Cardiac, cerebral, renal, and gas- 
trointestinal malformations were more frequent in the 
patients with unbalanced translocations resulting in 5p dele- 
tions. Mainardi and colleagues also reported intubation diffi- 
culties linked to larynx anomalies and delayed psychomotor 


3 Voice Disorders of Structural Origin 25 





ve 
ye di tte 
MeTETITELE 
baa 


po =e 


aha 


% 
GRADA Qa 
xY 


EA% 


Figure 3.1 Defective B-group chromosome in patient with cri-du-chat 
syndrome. Arrows show the partial short-arm deletion of a B-group 
chromosome. (From Ward, P.H., Enel, E., Nance, W.E. [1968]. The larynx 
in the cri du chat [cat cry] syndrome. Am Acad Ophthalmol 72, 90-102.) 


development related in severity to deletion size and type as 
well as other genetic and environmental factors. The distinc- 
tive voice characteristics resulting from the structurally small 
larynx and resonance chambers and the neurologic differences 
follow: 


+ overall fundamental frequency invariably higher (X = 860 
Hz) than for pain (X = 530 Hz) and hunger (X = 500 Hz) 
cries of normal infants 


+ cry durations characteristically longer than normal (X = 
2.6 seconds) and similar to normal pain cries 


+ flat or rising melody patterns 
è strained quality rarely containing glottal plosives 


crying on inhalation with inhalatory stridor (rare in nor- 
mal infants). Noteworthy is the observation that listeners 
had difficulty in determining whether the cry at a partic- 
ular moment was occurring on inhalation or on exhala- 
tion, with the inhalatory-exhalatory cry signals failing to 
demonstrate the normal “on-off” phenomenon. 


Congenital Laryngeal Stridor 


Stridor means involuntary sound made during inhalation or 
exhalation. Stridor can occur from any laryngeal disease (not 
just laryngomalacia) that produces partial obstruction of the 


airway either subglottally, glottally, or supraglottally. 
Although obstruction can occur at any age, detection of stri- 
dor is especially critical in the newborn because it signals 
impending asphyxiation. Abnormal anatomic development, 
laryngeal masses, inflammation, and paralysis are its most 
common causes. Different-sounding types of stridor Rohde, 
M. and Banner, J. (2006). Respiratory tract malacia: possible 
cause of sudden death in infancy and early childhood. Acta 
Paediatrica 95: 867-870. 


Laryngomalacia 


In laryngomalacia, the laryngeal cartilages are soft and under- 
developed. Morphologically, cartilage rings are shorter and 
softer than normal, causing collapse, especially during forced 
expirations. The etiology of laryngomalacia is uncertain. The 
voices on exhalation, during crying, and during other types of 
phonation may sound normal, but on forced exhalation and 
inhalation there is a low-pitched vibratory fluttering or a high- 
pitched crowing, more often intermittent than constant. Other 
symptoms may include wheezing, barking cough, frequent res- 
piratory infections, and cyanotic episodes (Rohde and Banner, 
2006). The stridor is more common when the infant is supine 
than prone. It often goes unnoticed at birth but becomes appar- 
ent a few weeks after discharge from the hospital. 

Following is the physical appearance of the larynx in 
laryngomalacia on examination: 


+ The epiglottis is low and narrow and has curled edges 
(omega-shaped). 


+ The aryepiglottic folds are approximated, obscuring the 
glottis; sucked into the glottis on inhalation; and blown 
out on exhalation. 


+ The mucosa over the arytenoid cartilages appears redun- 
dant. 


All these features, it should be pointed out, can be found in 
the normal infant larynx, the difference between the normal 
larynx and laryngomalacia being one of degree. Prognosis for 
the disappearance of laryngomalacia within a few months to 
1 year is good. Nevertheless, Rohde and Banner have cau- 
tioned, “Laryngo-tracheo-bronchomalacia (LTBM)...may be 
an unrecognized cause of sudden death in infancy and early 
childhood, and should be considered, especially in cases 
where there is a history of respiratory distress.” 


Congenital Subglottic Stenosis 


Arrested embryonic development of the conus elasticus or 
maldevelopment of the cricoid cartilage will produce an 
obstructive narrowing of the airway from the level of the 
vocal folds down to the cricoid area, the point of maximum 
obstruction being 2 to 3 mm below the glottis. The voice will 
be stridorous from birth, but sometimes this will be noted 
only during respiratory infections. The voice during crying is 
usually normal (Ferguson, 1970). 

Although the cause of the arrested development is 
unknown, Tateya, Tateya, Surles et al. (2006) have presented 
some intriguing data on malformed tracheal and laryngeal 
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cartilages in vitamin A-deficient rats. In rats, they have 
shown that pups of a vitamin A-deficient mother have 
maldevelopment of the cricoid cartilage among other laryn- 
geal deformities such as laryngotracheal clefts. 


Laryngotracheal Cleft 


Embryonic failure of fusion of the dorsal cricoid lamina 
leaves an interarytenoid cleft and an open larynx posteriorly. 
Few cases have been described, and correct identification of 
the defect is reportedly difficult. The cry is weak, feeble, or 
aphonic. Feeding difficulties and survival initially over- 
shadow the voice disorder. 


Congenital Laryngeal Web 


A web of tissue covering part or all of the glottis may be 
present at birth because of its failure to separate during the 
10th week of embryonic laryngeal development. Infants may 
present with various types of laryngeal webs congenitally, 
secondary to infection, or traumatically from intubation (Liu, 
Luo, Zhong, et al. 2006). The distinction is important because 
of the association of congenital webs with velocardiofacial 
syndrome (chromosome 22q11.2 deletion). Miyamoto, Cot- 
ton, Rope, et al. (2004) reported that 11 of their 17 patients 
with anterior glottic webs, or 65%, were positive for chromo- 
some 22q11.2 deletion. Of these 11 patients, 5 showed subtle 
clinical manifestations of velocardiofacial syndrome and 
underwent genetic testing due only to the presence of a web. 
All 11 patients were diagnosed with velocardiofacial syn- 
drome. 

The extent and severity of webs vary widely. If the web is 
complete, asphyxiation and death are imminent unless the 
web is recognized and surgically divided. Figure 3.2 shows 
an extensive laryngeal web, but the configuration varies; 





Figure 3.2 Membranous laryngeal web. (See color plate.) 


some are concave posteriorly (illustrated), others are thick, 
and still others are so thin and transparent that vocal fold 
movement can be seen underneath. A small microweb 
near the anterior commissure may go unrecognized for 
years. Microwebs have been implicated as an etiologic fac- 
tor in vocal nodules resistant to behavioral management 
(Bouchayer and Cornut, 2000). These authors suggest that 
in shortening the vibratory length of the vocal folds, the 
microweb results in greater impact stress in the midmem- 
branous portion of the vocal fold thus making the speaker 
more vulnerable to vocal fold trauma. Approximately 75% of 
laryngeal webs occur at the level of the vocal folds, the 
remainder being equally distributed in the supraglottic and 
infraglottic larynx. The voices of infants and children who 
have laryngeal webs vary depending upon the location and 
extent of the latter. When the web is sufficient to impinge on 
vibration, the cry and the speaking voice are abnormal, rang- 
ing from hoarseness to aphonia. During quiet breathing, 
inhalatory stridor will be heard in cases of near-glottal 
occlusion. With lesser webs, stridor and breath impairment 
may be limited to periods when the child engages in vigor- 
ous physical activity. 

Because webs are associated with a broad spectrum of symp- 
toms, they require an equally broad spectrum of treatment 
ranging from observation alone to complex open airway 
surgery (Milczuk, 2000). Most critical in management deci- 
sion-making is to ensure a safe airway while maximizing con- 
servation of respiratory, phonatory, and swallowing abilities. 


Congenital Cysts and Sulcus Vocalis 


Sessile, nonpedunculated, fluid-filled cysts can arise from 
any of the laryngeal soft tissues; however, they almost 
always arise from the laryngeal ventricle, displacing the true 
and false vocal folds and causing glottic and supraglottic 
obstruction. The voice or cry will be feeble or aphonic but 
differentially diagnostic from other obstructions. These 
lesions are potentially remediable. Often, they are not 
noticed until the child develops speech. Bouchayer and Cor- 
nut (2000) have suggested that congenital cysts may rupture 
and leave longitudinal scars or pits (vocal sulci) or mucosal 
bridges that disrupt the propagation of the mucosal wave 
resulting in a dysphonia difficult to remediate through either 
surgery or behavioral management. 


Papilloma 


Laryngeal papillomas are common, benign neoplasms in 
children, have a tendency to recur despite repeated surgical 
removal, and inevitably produce an abnormal voice. Conven- 
tional wisdom suggests they often regress with age, disap- 
pearing during puberty, but in a description of the clinical 
course of 95 patients with recurrent respiratory papilloma 
(RRP) Maknke, Frohlich, Lippit, et al. (1996) state that puberty 
had no effect on the clinical course. The etiology is unknown 
but thought to be viral caused by the human papilloma virus 
(HPV) group. Different types of HPV, particularly types 6 and 
11, have been implicated, which may explain differential 
response to treatment (Rahiknen, Aaltonen, Syrajanen, 1993). 
Morphologically, papillomas are tumor-like, frond-shaped 
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Figure 3.3 Papilloma of the larynx. (See color plate.) 


proliferations of squamous cell epithelia, are exophytic, and 
may have a cauliflower or raspberry appearance (Fig. 3.3). 
These growths frequently extend to include the laryngeal 
ventricles as well as the true vocal folds. When the lesions 
are soft, the vocal folds remain mobile though the mucosal 
wave is likely to be impaired resulting in the dysphonic 
voice. Lesions may spread to the supraglottic and subglottic 
regions and even into the trachea and bronchi. If untreated, 
they can produce asphyxiation and death (Sone and Sato, 
2006). Moreover, Mahnke, Frohlich, Lippert, et al. (1996) 
observed that 4 of 95 RRP patients exhibited malignancy 
degeneration. 

Voice signs and symptoms are severe hoarseness, abnor- 
mal cry, abnormal pith, weak loudness, aphonia, and/or 
wheezy respiratory stridor. 

The RRP lesions may be present at birth but are most often 
detected between the first and eighth years of life, with the 
most common identification occurring between ages 4 and 6 
years when the child has sufficient speech and physical 
activities that the signs and symptoms are exacerbated. 

Because of the recurring nature of this virus, treatment is 
difficult, and to date no single treatment has emerged as a 
reliable solution. Rosen and his colleagues have reported on 
a series of surgical and adjunct medical treatments for papil- 
lomatosis (Bryson Rosen, and Osborne, 2002; Rosen and 
Bryson, 2004; Lee and Rosen, 2004). They reported that 
whereas surgery debulks the tumors, juvenile laryngeal 
papilloma is resistant to cure; the growth generally recurs, 
which may require multiple surgeries and lead to prolifera- 
tion of scarring and or webs. Holland, Koufman, Postma, 
et al. (2002) have proposed that prophylactic treatment of 
laryngoesophageal reflux may reduce the laryngeal soft tis- 
sue complications encountered in surgery for RRP patients. 
Nevertheless, tracheostomy may be required as a temporary 
measure to preserve the airway and, because of this respira- 
tory restriction occurring in the young child, delayed speech 


and expressive language development can be one of its 
sequelae. Adjunctive medical treatments, directed at con- 
trolling the virus and growth of the tumor including cido- 
fovir, indole-3-carbinol (13-C), interferon, photodynamic 
therapy, and vaccines hold considerable promise. Rosen and 
colleagues have demonstrated some success with both 13-C 
and cidofovir but suggest further study regarding long-term 
follow-up, side effects, and efficacy in pediatric patients. 
According to Zietels (Akst, Broadhurst, Burns, et al., 2007), 
phonomicrosurgical microflap resection techniques can 
eradicate adult glottal RRP and appear to be preferable to 
conventional debulking or ablation techniques. Whether or 
not similar success occurs in children is unknown. 


Inflammation 


An extensive group of inflammatory reactions of the vocal 
folds and surrounding tissues can cause aphonia or dyspho- 
nia. 


Acute Laryngitis 


Hoarseness, cough, and sore throats are common signs of 
acute laryngitis caused by viruses, bacteria, or chemical irri- 
tants. Pharyngeal and laryngeal erythema are found on 
examination. When the inflammatory process is sufficient to 
cause increased stiffness, the vocal folds do not vibrate well 
resulting in aphonia or dysphonia of a few days’ duration. 
Laryngitis lasting longer than 10 to 14 days is not normal and 
needs medical attention. 


Croup 


Several inflammatory laryngeal conditions produce a triad of 
hoarseness, barking cough, and inhalatory stridor called 
croup. Acute laryngotracheobronchitis, also known as viral 
croup, is the most common form. Acute epiglottitis, which 
has a similar triad of signs, occurs abruptly and is most often 
caused by Haemophilus influenzae. Diphtheritic croup, 
although now rare, is still another type. Each of these can 
progress to respiratory obstruction and death unless an ade- 
quate airway is maintained. 

In contrast with those with acute laryngitis, croup patients 
have systemic malaise in addition to dysphonia and stridor. 
Acute laryngotracheobronchitis, epiglottitis, and diphtheria 
all cause fever and dysphagia. In diphtheria, the initial respi- 
ratory and phonatory difficulties are caused by the appear- 
ance of a thick, adherent necrotic membrane. Patients who 
survive this stage often develop a toxic neurologic syndrome 
of vocal fold and soft palate paralysis due to vagus nerve 
involvement. Even if the glottic obstruction is overcome, the 
paralysis can extend to the respiratory musculature, requir- 
ing ventilatory support to avoid a fatal outcome. 


Malnutrition 


The cries of malnourished infants strongly resemble those of 
infants with central nervous system disease (see Chapter 9) 
according to Lester (1976), who described the cries of 12 
malnourished infants to be of high pitch, low intensity, and 
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excessively long duration, with an abnormally long latency 
between the first and second cry. 


Down Syndrome 


Down syndrome is one of the most frequently occurring 
chromosomal disorders (Moura, Cunha, Vilarinho, et al. 
2006). The cry of infants with Down syndrome has a flat 
melody, lower than normal pitch, and a tense or strident, 
harsh, rough, pressed, howling, and guttural quality. The 
tenseness or stridency has been attributed to hyperadduc- 
tion of the ventricular as well as true vocal folds during 
phonation (Novak, 1972; Wasz-Hockert, Lind, Vuorenkoski, 
et al. 1968). 

The majority of descriptions of the vocal characteristics of 
individuals with Down syndrome have come from percep- 
tual and acoustic studies. Limited information has been 
reported from visualization of the larynx or vocal tract shape 
necessitating that inferences be made about the etiology of 
the vocal differences. In their study of 66 children with 
Down syndrome, Moura et al. (2008) noted the pheno- 
typic characteristics of Down syndrome children include 
general hypotonia and maxillary hypoplasia with relative 
macroglossia and that these contribute to particular acoustic 
alterations, which they identified as a lower fundamental 
frequency, F(0), with elevated dispersion. The conjunction of 
frequencies for formants, F(1) and F(2), revealed a decreased 
distinction between the vowels, reflecting the loss of articu- 
latory processing. Pentz (1987) also reported that the speak- 
ers with Down syndrome had formant amplitude intensity 
levels that were significantly lower than those of a similar 
group of nonretarded speakers. Interestingly, an earlier 
study done by Moran (1986) comparing perceptual and 
acoustic characteristics of adults with Down syndrome to 
adults without developmental disabilities revealed that lis- 
teners perceived hypernasality as one of the distinctive char- 
acteristics, but when voices were quantified, there was no 
significant difference in formant frequencies or acoustic 
indications of resonance differences. 

The acoustic features that repeatedly have been reported 
as significantly different are frequency perturbation meas- 
ures and spectral noise-to-harmonic component ratios 
(Beckman, Wold, and Montague, 1983; Moran and Gilbert, 
1983; Pentz and Gilbert, 1982). Pentz and Gilbert also 
reported that there were significant correlations between 
judges’ severity ratings and frequency perturbation levels, 
amplitude perturbation levels, and the spectral noise-to- 
harmonic component ratios. It is likely that the vocal tract 
shape differences (Beckman et al, 1983; Moura et al, 2006) 
coupled with motor control problems and differences in 
sidewall reflections from general hypotonia contribute to 
these distinguishing characteristic voice patterns. 

With few exceptions, the majority of voice data come 
from studies of children over 8 years of age and adults. Stud- 
ies of premeaningful vocalizations produced by normally 
developing infants and infants with Down syndrome by 
Smith and Oller (1981) demonstrate that both groups began 
to produce canonical, reduplicated babbling at 8 to 81/2 
months of age, and trends regarding consonantal and vocalic 
development for the two groups were very similar during 


approximately the first 15 months of life: a time when one 
could argue that hypotonia of the two groups might be more 
similar than in later life. 


Trauma 


The larynges of neonates and infants are susceptible to 
external and internal trauma of the blunt and penetration 
variety, as well as that caused by caustic substances inhaled 
or ingested. 


+ Voice Disorders Due to Structural 
Changes in Adulthood 


Chronic Nonspecific Laryngitis 


Dysphonia, vocal fatigue, and cough are produced by chronic 
laryngitis consisting of vasodilation and edema of the vocal 
folds. Persistent epithelial tissue changes occur in which the 
mucosal glands atrophy. The resulting dryness of the mucosa 
is called laryngitis sicca. Laryngologists believe that cigarette 
smoking, air pollution, alcohol, and vocal abuse are contrib- 
utory factors. 


Sjogren’s Syndrome 


Dryness of the larynx may also result from Sjégren’s syn- 
drome (Ogut, Midillik, Oder, et al. 2005). Sjogren’s syndrome 
is an autoimmune disease affecting 4 million Americans, 
most commonly women in their late forties (Sjogren’s Syn- 
drome Foundation, Inc., 2007). Dry eyes, dry mouth, and dry- 
ness in other organs such as the larynx, lungs, and CNS result 
from the body’s immune system attacking its own moisture- 
producing glands. The appearance of the larynx of women 
with Sjégren’s syndrome can range from vocal folds that lack 
luster to vocal folds covered with a thick, tenacious, dry cov- 
ering of mucus. The impact of a dry larynx on voice can be 
severe because it inhibits regular vocal fold vibration, 
decreases vocal efficiency, requiring the speaker to increase 
effort, which may account for the increase in vocal patholo- 
gies in this population. As example, Ogut and colleagues 
reported significant changes in the Reflux Finding Score 
(RFS) and the Reflux Symptom Index (RSI), the objective 
voice quality using Jitter (JITT), Pitch Period Perturbation 
Quotient (PPQ), Shimmer (Shim), Amplitude Perturbation 
Quotient (APQ), and Noise-to-Harmonic Ratio (NHR), and 
the presence of a variety of laryngeal pathologies in this pop- 
ulation. 


Chronic Hypertrophic Laryngitis 


Hypertrophy of the laryngeal mucosa is a common result of 
excess smoking and alcohol. Leukoplakia, a thickening of the 
epithelial and subepithelial layer, is characterized by the 
appearance of white patches on the vocal folds, which can 
be premalignant. Simple chronic laryngitis is manifested by 
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hoarseness, an urge to clear the throat, diffusely red or pink 
mucosa, rounded rather than sharp vocal fold margins, and 
strands of mucus bridging the glottis from one vocal fold to 
the other. Submucosal edema eventually destroys the his- 
toarchitecture of the basement membrane separating the 
epithelium from the underlying structure. The superficial 
layer of the lamina propria, normally a somewhat amor- 
phous space, becomes engorged with fluid; this change is 
called polypoid degeneration, or Reinke’s edema. 

Reinke’s edema is often difficult to differentiate from 
other benign lesions arising from the superficial layer of the 
lamina propria because these so-called mass lesions all 
exhibit fluid accumulation in Reinke’s space and are believed 
to originate from excess internal (smoking, alcohol, reflux) 
or external (vocal abuse or misuse) trauma. 

Current measures of phenotypic expression do not 
always differentiate pathologies arising from the superfi- 
cial layer of the lamina propria. This makes accurate diag- 
nosis difficult and may contribute to why some patients 
with the “same pathology” respond differently to the same 
treatment. Because phenotypic markers typically used in 
the clinic are insufficient to differentiate disorders of the 
lamina propria, Gray and Thibeault have suggested using 
genotypic analyses. 

The rationale for genetic analyses is that genetic informa- 
tion is used by the cells to make proteins. These proteins are 
then used to perform cellular and tissue functions. These cel- 
lular and tissue functions, in turn, determine the structure 
and its biomechanical properties. Gene expression studies of 
vocal lesions such as those done by Gray and Thibeault 
underscore the nature of its importance in increasing our 
understanding of vocal fold dynamics and how it is affected 
by the disease. They have also demonstrated in several small 
studies the role of microarray analysis to find genes of inter- 
est and that it is of paramount importance to increasing our 
understanding of the vocal fold and its diseases. The use of 
DNA microarray technology analysis of global patterns of 
gene expression can reveal unexpected networks of coordi- 
nated regulation in the extra cellular matrix (ECM) of the 
lamina propria. Thibeault, Hirschi, and Gray (2003) demon- 
strated the feasibility of using DNA microarray technology to 
establish transcriptional gene expression patterns for vocal 
fold polyp and vocal fold granulomas. Although in their ini- 
tial report they only had one of each pathology to compare 
with one normal, they identified different gene expression in 
these tissues representing inflammation and wound healing. 

It is well known that gene expression may be influenced 
by various environmental factors. This interaction between 
environment and genes is constantly manifested in vocal 
characteristics and disorders. Thus, Duflo, Thibeault, Li, et al 
(2006) used complementary DNA (cDNA) microarray analy- 
sis to establish distinct gene expression profiles for Reinke’s 
edema and polyps, two phenotypically similar vocal fold 
lesions. They analyzed 11 Reinke’s edema and 17 polyp 
specimens with microarray analysis for 8745 genes and fur- 
ther microarray analysis profiling within groups to identify 
molecular markers for reflux exposure and smoking. Sixty- 
five genes were found to differentiate Reinke’s edema and 
polyps. In Reinke’s edema, 19 genes differentiated for reflux 
exposure, but no genes were found to differentiate smokers 


from nonsmokers. For polyps, no genes were found to dif- 
ferentiate for reflux or smoking. Categorization of unclassi- 
fied lesions was possible with a minimum of 13 genes 
demonstrating not only the feasibility of benign lesion clas- 
sification based on microarray analysis but also its usefulness 
for improving diagnosis potentially generating prognostic 
indicators and targets for therapy. 

Thibeault, Gray, Li, et al. (2002) related gene expression in 
benign vocal lesions to phenotypic marker of stiffness deter- 
mined from perceptual judgments of stroboscopic images. 
They determined there was a clear relationship between 
genotypic expression found in polyps and Reinke’s edema. 
Polyps, characterized by stiffer mucosal waves, had higher 
levels of gene expression, whereas stiffer mucosal wave 
scores for Reinke’s edema were associated with lower gene 
activity levels. 

Authorities are not in total agreement about the etiology 
and differences in appearance of vocal nodules (Fig. 3.4) and 
vocal fold polyps (Fig. 3.5). Both are mass lesions with fluid 
accumulation in Reinke’s space. Nodules are generally bilat- 
eral and symmetric, whereas polyps are more often unilateral 
but may cause irritation on the contralateral fold creating an 
appearance similar to that of nodules. Because differences 
are often difficult to determine from indirect or stroboscopic 
examination, Rosen and colleagues have suggested the term 
podule. Thibeault (2006) has demonstrated that while phe- 
notypically they may appear similar, nodules and polyps are 
genetically dissimilar. 

Critical to normal vocal fold vibration is the composition 
of the extracellular matrix of the lamina propria. Under- 
standing the underlying molecular and genetic regulation 
and homeostasis of the ECM are essential to development of 
new Clinical interventions meant to restore normal vibratory 
properties (Thibeault, 2005). Moreover, because the end 
result of the gene-environmental interaction is the produc- 
tion of cellular and tissue proteins, the field of proteomics 
holds considerable promise for helping us understand etiol- 
ogy of voice disorders and developing improved treatment 
(Gray and Thibeault, 2002). 


Cricoarytenoid Arthritis 


Rheumatoid arthritis (RA) is a systemic disease of connec- 
tive tissue that affects joints lined with synovial membrane. 
According to Tarnowska, Amernik, Matyja, et al. (2004), 
cricoarytenoid joint involvement is found in 27 to 78% suf- 
fering from RA. Hoarseness and stridor, as well as throat 
pain during phonation, during swallowing, and from medial 
compression of the thyroid cartilage radiation to the ears, 
can be signs of RA. In the acute phase of the disease, patients 
complain of burning and foreign body sensation in the 
throat, hoarseness, pain on speaking, voice fatigability, and 
problems with swallowing. Laryngoscopy shows edema or 
redness, or both, of the mucosa surrounding the cricoary- 
tenoid joints, and reduced or absent cricoarytenoid joint 
and vocal fold motion unilaterally or bilaterally. Patients 
with RA or ankylosing spondylitis and cricoarytenoid 
arthritis can become dyspneic because of vocal fold 
fixation and can require tracheostomy and arytenoidec- 
tomy. At least one such patient experienced a relief of 
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Figure 3.4 (A) Bilateral vocal nodules. (See color plate.) (B) Bilateral polypoid vocal folds and contact ulcer on 


left vocal process. 


symptoms after corticosteroid therapy (Bienenstock and 
Lanyi, 1977). 

Laryngeal involvement in RA is not uncommon and may 
include cricoarytenoid arthritis or vocal fold lesions such as 
vocal fold rheumatoid nodules or bamboo nodes (Speyer, 
Speyer, and Heijnen, 2006). These investigators used 
quality-of-life questionnaires to study the prevalence and 
the relative risk of dysphonia when suffering from RA com- 
pared with that of healthy subjects in 166 subjects with 
rheumatic arthritis and 148 healthy control subjects. RA 
patients had statistically significant higher prevalence and 





Figure 3.5 Sessile polyp of the vocal fold. 


relative risk of dysphonia; RA patients varied between 12% 
and 27%, and the healthy subjects showed prevalence data 
varying from ~3 to 8%. 


Malignant Tumors 


According to English (1976), ~80% of all laryngeal tumors 
are malignant (although this estimate may be low), occur 
most often between ages 50 and 70, are more common in 
men than in women, and are causally linked to cigarette 
smoking, alcohol, and other sources of chronic laryngeal tis- 
sue irritation. The most common malignant lesion by far is 
squamous cell carcinoma (Fig. 3.6). Adenocarcinoma, the 
next most common malignancy, is rare. The relative fre- 
quency of squamous cell carcinoma according to site of 
lesion is shown in Fig. 3.7. Approximately 50% originate on 
the true vocal folds and spread to structures beyond. Within 
the vocal folds themselves, 75% originate from the anterior 
half of the true vocal fold, or near the anterior commissure. 
Hoarseness is initially the only sign of laryngeal cancer 
when it involves the true vocal folds. Lesions in the ventricle 
or supraglottal area may not present with hoarseness. A 
lump or pain in the throat may indicate spread. Inhalatory 
stridor from obstruction of the airway is most often a very 
late sign. Throat pain, shortness of breath, difficulty in swal- 
lowing, a mass in the neck, weight loss, ear pain, cough, and 
fetid breath also occur, depending on the growth pattern of 
the tumor. 

Early detection of laryngeal carcinoma cannot be overem- 
phasized. Because hoarseness is its only early sign, its 
presence for more than 6 weeks should arouse suspicion of 
cancer until proved otherwise. If discovered early, the cancer 
may be kept from invading surrounding tissues supplied by 
the lymphatic system and thereby metastasizing to other 
organs. 
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Figure 3.6 Carcinoma of the larynx. (See color plate.) 





Figure 3.7 Relative frequency of site of origin of squamous cell carci- 
noma of the larynx. (From English, G.M. [1976]. Malignant neoplasms 
of the larynx. In English, G.M. [Ed.], Otolaryngology. New York: Harper 
& Row.) 


Endocrine Disorders 
Hypothyroidism 


Hypothyroidism is caused by insufficient secretion of thy- 
roxin by the thyroid gland. The term hypothyroidism is used 
in reference to the hormonal deficit when the disease is mild 
and myxedema in reference to the physical changes that 
appear when the disease is severe. Its onset in adults is insid- 
ious, taking months or years to develop; because of this 
delay, missed diagnosis is common. Lethargy, intolerance to 
cold, puffiness around the eyes, dryness of the skin, and loss 
of frontal hair are classic physical signs. In more severe 
hypothyroidism, the central nervous system may be affected, 
particularly the cerebellum, resulting in ataxia and ataxic 
dysarthria. As hypothyroidism becomes severe, the clinician 
may find intellectual, language, and personality changes. As 
a consequence, such patients are occasionally mistaken as 
demented from primary central nervous system disease. 

The dysphonia of hypothyroidism is characteristically 
hoarse, sometimes described as coarse or gravelly, and of 
excessively low pitch. These voice signs are typical effects of 
mass loading of the vocal folds by their infiltration with 
myxomatous material. The importance of the dysphonia in 
the detection and diagnosis of hypothyroidism is illustrated 
by the following histories. 


Case Study 3.1 


A 67-year-old widow from Chicago asked her orthopedic 
surgeon if she could see someone about her voice. She 
had last been examined 6 months previously after hip 
joint surgery. One of her complaints was that she felt tired 
and fatigued and her voice had become hoarse or husky. 
Her vocal folds and remaining laryngologic examination 
were normal, and a diagnosis of “functional” voice disor- 
der was made. 

Interview with the patient about her voice revealed that 
it was hoarse most of the time and worse when tired. On 
voice evaluation, she was mildly hoarse and the pitch of 
her voice low, even after her age was taken into consider- 
ation. She volunteered that she found herself sleeping 
more than usual—3 or 4 hours a day—in addition to 8 hours 
at night, was chilly much of the time in her apartment, 
and was keeping her thermostat high. She complained 
that her skin was dry. A neurologist tested her Achilles’ 
reflexes and found that her muscles were slow in return- 
ing to a relaxed state. He ordered an evaluation of her 
total serum thyroxine level and found that it was abnor- 
mally low. He made a diagnosis of hypothyroidism and 
instituted thyroid hormone replacement therapy. 


Case Study 3.2 


After surgery for rectal carcinoma, a 57-year-old male ele- 
mentary teacher complained of cold intolerance in his 
feet and hands, loss of sensation in his feet-as if there 
were “pads” under his soles—and weakness of the lower 
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extremities. These symptoms progressed over a 13-year 
period from the time of his surgery. He complained of diffi- 
culty in climbing stairs and arising from a squatting position 
and inability to lift modest weights. Although the weakness 
varied from day to day, there was no daily fluctuation, and 
the weakness was not related to amount of physical exer- 
tion. He had an unsteadiness of gait-a tendency to stagger 
more to the left than to the right-and noted loss of hair 
over his lower extremities, loss of sweating, andimpotency. 

Neurologic examination showed minimal distal sensory 
loss with moderate proximal weakness. Deep tendon 
reflexes were mildly decreased and were slow. There was a 
slowing and an irregularity of rapid alternating move- 
ments and a mild unsteadiness of gait. Nerve conduction 
studies confirmed mild sensorimotor neuropathy and 
mild proximal myopathy. 

General physical examination revealed dry coarse skin, 
sparse hair, and puffy facies. He had periorbital edema, 
dry skin, scaling, and decreased deep tendon reflexes with 
a slow return phase. 

Although the patient and family were not aware of any 
change in the patient’s speech, the pitch of his voice was 
hoarse and deep, particularly when he was asked to sing 
down to the bottom of his pitch range. Articulation 
was performed with irregular articulatory imprecision 
brought out by alternate motion rate testing of [p^], [t^], 
and /k*/. The tip and blade of his tongue protruded 
between the teeth for the production of /s/ and /z/ 
sounds. The patient was facetious throughout the exami- 
nation, making jokes about questions put to him concern- 
ing his illness, particularly his speech, which he said was a 
source of humor in the classroom. 

On laryngologic examination, his vocal folds were 
myxedematous. 

Adiagnosis of advanced myxedema with myxedematous 
dysphonia, associated ataxic dysarthria, and lingual articu- 
latory imprecision due to edema of the tongue was made. 

In addition to the more typical myxedematous dyspho- 
nia, the clinician needs to be alert to distortions of articula- 
tion, particularly anterior lingual sounds due to increased 
tongue size caused by accumulation of myxomatous 
material in the tongue, and to ataxic dysarthria. 


Hyperthyroidism 


The dysphonia associated with hyperthyroidism has not 
been carefully described. Breathy voice quality and reduced 
loudness have been noted, presumably due to weakness of 
the respiratory and phonatory musculature. Associated with 
the dysphonia are anxiety, irritability, and fatigability. 


Hyperpituitarism 


A tumor in the region of the pituitary gland can cause over- 
secretion of the pituitary growth hormone somatotropin, 
producing an untimely enlargement of bone, cartilage, and 
soft tissue. The result is a condition known as acromegaly. 
When the condition begins in childhood, before epiphyseal 
fusion, gigantism results, in which the patient grows to an 
abnormally great height. 


The dysphonia of acromegaly is an excessively low pitch 
and hoarseness, the consequences of enlargement of the 
vocal folds and laryngeal cartilage. The reduced fundamental 
frequency of the voice is augmented by enlarged pharyngeal 
and oral cavities. Because the tongue becomes enlarged and 
the mandible prognathic, articulation defects are common. 


Amyloidosis 


Systemic amyloidosis is an uncommon disease in which an 
abnormal fibrous protein, amyloid, infiltrates the extracellu- 
lar compartments of connective tissue. The larynx is an 
occasional site for the localized expression of the disease. 
Laryngeal amyloidosis occurs as an atypical laryngeal nodule 
or subglottic stenosis. It is usually misdiagnosed as a vocal 
nodule or idiopathic subglottic stenosis. Patients with laryn- 
geal amyloidosis have breathy, strained breathy, or hoarse 
voices (Aronson, Burroughs, Duffy, et al., 1991). 


Virilization 

Virilization refers to the development of male sex characteris- 
tics in females. For example, in the condition known as per- 
verse mutation, the larynx grows to a larger-than-normal size 
during sexual maturation in the female because of excess 
secretion of the androgenic hormones. The voice is of abnor- 
mally low pitch. Females treated for menopausal symptoms 
with drug preparations containing androgen often develop vir- 
ilization and low voice pitch and hoarseness in addition to 
facial hair. Damsté (1967) also noticed falsetto pitch breaks. 
Like myxedema, the dysphonia of virilization can develop 
insidiously and remain unnoticed. Estrogen or testosterone can 
produce laryngeal edema, causing reduced pitch (Gould, 1972). 

Because the conditions are relatively rare, most of what we 
have learned about virilization comes from case histories. 
Kim, Kim, Kim, et al. (2006) reported on a 14-year-old girl and 
her younger sister, who presented with primary amenorrhea, 
deepening of the voice, and clitoromegaly. Through genetic 
analysis, they were able to determine the girls had 5 alpha- 
reductase deficiency. There was no evidence of genetic abnor- 
mality in the eight screened exons of the androgen receptor 
gene. Holt, Medbak, Kirk, et al. (2005) presented a case of a 
28-year-old woman with virilization occurring in two succes- 
sive pregnancies. In her case, differential diagnoses included 
ovarian disease and fetal aromatase deficiency. She presented 
during the third trimester of her first pregnancy with rapid 
onset of hirsutism, increased musculature, and deepening 
voice. A blood hormone profile revealed significant hyperan- 
drogenism (testosterone, 72.4 nmol/L; normal range, 0.5 to 
3.0). She delivered a normal boy, and maternal androgen con- 
centrations returned rapidly to normal. She presented 2 years 
later, during her second pregnancy, with similar symptoms 
and biochemistry. Again, she delivered a healthy normal boy, 
and androgens returned immediately to normal. 

A case of virilizing adrenocortical adenoma with Cush- 
ing’s syndrome, thyroid papillary carcinoma, and hypergas- 
trinemia in a middle-aged Japanese woman was reported 
by Fukushima, Okada, Tanikawa, et al. (XXX). This 45-year- 
old woman had a 10-year history of amenorrhea, hyper- 
tension, and diabetes. Physical examination showed a 
masculinized woman with severe hirsutism, male-like 
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baldness, deep voice, acne in the precordia, and clitorism. 
After resection of the adrenocortical adenoma, preopera- 
tive high testosterone and cortisol levels decreased to the 
normal range. 

An opposite change is desired in anthropometry with 
testosterone therapy in females with gender identity disor- 
der as detailed in a case study by Yamasaki, Douchi, and 
Nagata (2003). They report on a 31-year-old, regularly men- 
struating Japanese female diagnosed as having gender iden- 
tity disorder. Treated with 125 mg testosterone enanthate, 
intramuscularly, every 2 weeks for 4 months, her serum 
testosterone levels increased to the normal male value (from 
28 to 432 ng/dL) and her voice became lower despite the fact 
that her menstrual cycle remained regular. 

Use of anabolic steroids may also have a virilization side 
effect as reported from interview data collected from ten 
women athletes by Straus, Liggett, and Lanese (1985). The 
women perceived that their cyclical use of anabolic steroids 
resulted in increased muscle size and strength, deepening 
of the voice, increased facial hair and aggressiveness, cli- 
toral enlargement, and menstrual irregularities. According 
to Strauss, Liggett, and Lanese these female athletes were 
willing to tolerate these side effects, though they recog- 
nized the side effects might be unacceptable to other 
women. 


Feminization 


Case studies have also enlightened us about voice disorders 
associated with hormonal problems in males. Akcam, Bolu, 
Merati, et al. (2004) described the voice characteristics of 24 
males with isolated hypogonadotropic hypogonadism (IHH) 
before and after androgen therapy. Prior to therapy, the 
mean fundamental frequency (Fo) was 229 + 41 Hz, which 
after treatment decreased to 173 + 30 Hz approaching the 
normal male Fo. Males with IHH fail to undergo normal sex- 
ual development, including the lack of masculinization of 
the voice, with testosterone treatment the voice lowers to 
near normal male values. 


Hormones and Menstrual Cycle 


Several studies indicate that voice may be affected by the 
menstrual cycle; hoarseness and reduced pitch may be 
caused by vocal fold edema (Smith, 1962). Singers and 
actresses claim to be severely affected by these periodic 
episodes of dysphonia, which interfere with their perform- 
ances. Minor changes in voice production, such as voice 
onset time (VOT), may be no less significant. Differences in 
VOT, frequency perturbations, vocal loading, and laryngeal 
appearance during different phases of the menstrual cycle 
have been the focus of recent studies. Recognizing that VOT 
may be sensitive to changes in hormones, which may affect 
the neuromuscular systems involved in speech production, 
Whiteside, Hanson, and Cowell (2004) documented effects 
of menstrual cycle phase on VOT characteristics in naturally 
timed speech using whole words. Their findings document 
that ovarian hormones play some role in shaping some tem- 
poral components of speech. Abitbol (2004) has suggested it 
is imperative that clinicians know where females with vocal 


fold lesions are in their menstrual cycle when making man- 
agement decisions because he has documented that the 
appearance of lesions change in predictable manners in con- 
cert with the menstrual cycle. This notion is supported by a 
study of classical singers examined before and outside 
menses. Chernobelsky (2002) made two clinically important 
conclusions: (1) premenstrual hyperchanges were a result of 
combined abuse-and menses-related influence on the lar- 
ynx, and (2) laryngeal examinations before menstruation 
might be a useful test for the presence of physiologically cor- 
rect singing. 

In their scholarly review of hormone influence on voice 
Amir and Biron-Shental (2004) state that, “Sex hormone 
fluctuations were shown to affect female vocal folds and 
laryngeal function. Laryngeal changes are evident 
throughout the span of life, starting at puberty with the 
arousal of the hormonal system, fluctuating systemati- 
cally during the reproductive years with the menstrual 
cycle, and then changing again with the decline of hor- 
monal activity at menopause. Early studies that explored 
this relation were based merely on subjective impressions 
of voice quality, recent studies have used more objective 
tools for examining this relation, including histologic 
observations, stroboscope, electroglottography (EGG), and 
computerized acoustic analyses. In these studies, the lar- 
ynx was shown to be a hormonal target organ and, as such, 
sex hormones affect its morphology, histology, and func- 
tion, similar to their effect on the genitals and other 
organs. Examining the relation between sex hormones 
and the larynx could assist in understanding the mecha- 
nisms of voice production, and it could provide the clinician 
with supplemental diagnostic information on different 
medical conditions.” 

Because endocrine fluctuations impact voice, the possi- 
bility that use of the contraceptive pill might stabilize vocal 
quality by “dampening” hormonal fluctuations has recently 
been studied in nonprofessional voice users and in classi- 
cally trained voices. In a series of studies conducted by 
Amir and colleagues (Amir and Kishon-Rabin, 2004; Amir, 
Biron-Shental, Muchnik, and Kishon-Rabin, 2003; Amir, 
Kishon-Rabin, and Muchnik, 2002), it has been clearly 
demonstrated that oral contraceptives have no adverse 
effect on voice quality, but, in effect, most acoustic meas- 
ures show improved voice quality among women who used 
the birth control pill. They postulate that the differences in 
the noise indices between groups may also shed light on the 
nature of the effect of sex hormones on vocal fold activity. 
They further suggest that hormonal fluctuations may have 
more of an effect on vocal fold regulation of vibration than 
on glottal adduction. As example, Amir, Biron-Shental, and 
Shabtai (2006) reported two studies of nonprofessional 
voice users using birth control pills. In the first study, the 
researchers obtained voice samples from 30 women with 
no history of voice training, who used pills with different 
progestins (drospirenone, desogestrel, gestodene), and 10 
women who did not use the pill. Comparison of recordings 
did not reveal acoustic differences in sustained phonation of 
vowels across the pill groups and controls. In the second 
study, the authors conducted a meta-analysis using results 
from their first study jointly with results from three studies 
reported in the literature, which used similar methodologies. 
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Results of this meta-analysis indicated that pill users exhib- 
ited lower jitter and shimmer values on sustained vowels, 
whereas no difference of fundamental frequency was 
observed among women who use the pill. These results 
support the notion that not only is there no adverse effect 
on voice with new-generation monophasic birth control 
pills, but also the acoustic properties of the voice, which are 
reflected in perturbation measures in sustained vowels, 
actually may be improved among women who use the pill. 

A parallel approach recording voice acoustics of 24 profes- 
sional voice users using oral contraceptive pills was taken by 
Van Lierde, Claeys, De Bodt, et al. (2006). Objective (e.g. Aero- 
dynamic, voice range profile, acoustic, Dysphonia Severity 
index [DSI], and nasometry) and Subjective (e.g. Perceptual 
evaluation of voice and nasal resonance) were assessed when 
hormonal levels reached a steady state (between 10th and 
17th day following gestation of pill) and when hormonal lev- 
els were minimized. They found no significant differences and 
concluded that oral contraceptive pills do not have an impact 
on the objective and subjective voice or resonance parameters 
in young professional voice users, at least as measured in this 
study. Analogous findings were reported in the first double- 
blind, randomized, placebo-controlled trial of professional 
classically trained singers by (2006). In fact, results from La, 
Ledger, Davidson, Howard and Jones (2007) double blind, ran- 
domized placebo controlled trial, designed to assess the 
effects of the contraceptive pill on patterns of vibration, sug- 
gest a positive effect on performance of Western classical 
singing repertoire. Use of the contraceptive pill, drospirenone 
(Yasmin, Schering AG, West Sussex, UK) with antiandrogenic 
and antimineralocorticoid properties, reduced the irregularity 
of the vibratory pattern in highly trained classical singers. 

The picture appears to change when premenstrual syn- 
drome is taken into account. Chae, Choi, Kang, (2001) evalu- 
ated the relationship between voice change and premenstrual 
syndrome (PMS) in positive PMS and negative PMS groups. 
There were no significant differences during menses phases 
in the negative PMS group, but perturbation measures were 
increased during the premenstrual phase compared with 
the follicular phase in the positive PMS group suggesting 
more careful voice habituation is required during the pre- 
menstrual phase in this group. 


Trauma 


Diffuse laryngeal trauma causes dysphonia or aphonia 
when the vocal folds and surrounding area are damaged by 
(1) blunt or penetrating injuries from neck injuries such as 
when being choked or in automobile accidents in which 
the victim is thrown forward, striking the larynx against 
the dashboard of the car and causing compression frac- 
ture; (2) penetrating wounds of the larynx from gunshot 
wounds or stabbings; (3) striking of the neck against wires 
and cables during skiing, snowmobiling, motorcycling, and 
water-skiing accidents; (4) biomechanical vibratory 
trauma; or (5) laryngeal esophageal reflux. Edema, 
hematoma, fractures, dislocations, lacerations, and paraly- 
sis are simultaneous tissue reactions to the first three of 
these injuries. The laryngeal surgeon attempts to restore 
damaged tissue to its normal configuration to allow 
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Figure 3.8 Sketch of laryngeal trauma showing anteroposterior 
laryngeal compression of cartilages and soft tissues. 


phonation, but the initial consideration is maintaining a 
patent airway. Once out of danger, the patient’s deglutition 
and voice become of concern. The voice, swallowing, and 
airway compete for whatever degree of functional-struc- 
tural restoration is possible. 

Laryngeal lesions due to trauma can be classified into 
supraglottic, glottic, and subglottic types. The glottic form 
most often results in dysphonia or aphonia, because in this 
type thyroid cartilage fracture in the anterior commissure 
area displaces fragments of cartilage posteriorly, lacerating 
the vocal folds and causing edema (Fig. 3.8). 

Trauma and vocal fold paralysis can occur from laryngeal 
intubation for ventilatory support during surgeries especially 
if the endotracheal tube is too large for the patient’s airway 
(Fig. 3.9) or if the patient has predisposing conditions of 
laryngeal esophageal reflux, vocal abuse, or a disproportioned 
airway space to the arytenoid cartilage. Mucosal ulceration 
may lead to granuloma. Edema, mucosal laceration, and abra- 
sion are temporary. Granulomas may be self-remediated or 
stubbornly resist treatment and tend to recur even after surgi- 
cal removal or steroid injection. Glottal strictures can result 
from untreated granulation. Dislocation of the arytenoid 





Figure 3.9 Postendotracheal anesthesia granulomas from trauma. 
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cartilages may also result from laryngeal intubation. Dyspho- 
nia or aphonia are inevitable in such cases, depending upon 
the severity of the trauma. Posterior cricoarytenoid injury can 
result from nasogastric intubation and can mimic recurrent 
laryngeal nerve palsy. Differential diagnosis is most often 
made through laryngeal electromyography (EMG). 

Prolonged endotracheal intubation can produce serious dis- 
orders of phonation due to traumatic irritation, edema, com- 
pressive effects, inflammation, ischemia, and loss of vocal fold 
mass. Scarring, granulomas, ulcerations, macerations, forma- 
tion of membranous fibrous tissue on the arytenoid cartilage 
or within the interarytenoid space, cricoarytenoid subluxa- 
tion, ankylosis, vocal fold paralysis, and subglottal stenosis can 
be produced by excessive endotracheal tube size, tube tip irri- 
tation, cuff pressure, and the natural curvature of the endotra- 
cheal tube. In a study of 17 patients, ages 17 to 85 years, who 
had been intubated for periods ranging from 10 hours to 4 
weeks, Gallivan, Dawson, and Robbins (1989) found a total of 
31 lesions in 16 of the 17 patients, 77% involving the vocal 
folds, 13% the arytenoid cartilage, and 10% the interarytenoid 
notch and lumen between the vocal folds. Phonation, respira- 
tion, and coughing were compromised in these patients; voice 
impairment was virtually universal. 

Endotracheal intubation also can cause vocal fold paralysis 
(Ellis and Pallister, 1975; Hahn, Martin, and Lillie, 1970; 
Holley and Gildea, 1971; Yamashita, Harada, Ueda, 
et al., 1965). Lim, Chia, and Ng (1987), in a report of three 
patients who developed right vocal fold paralysis after endo- 
tracheal intubation during surgery unrelated to the neck, 
found severe hoarseness and the right vocal fold paralyzed in 
the paramedian position up to 6 weeks after surgery in two 
patients, and only slight voice improvement 2 months after 
intubation in a third patient. In all three patients, voice 
improvement was due to compensation by the unimpaired 
vocal fold. It is speculated that the cause of vocal cord paral- 
ysis occurs from pressure arising from an overinflated cuff of 
the endotracheal tube compressing the peripheral anterior 
branches of the recurrent laryngeal nerve. 

Caustic substances swallowed by children (and often by 
adults who attempt suicide) that cause grave damage to the 
larynx and esophagus are lye, ammonia, sodium hypochlorite 
(Clorox), and orthophenylphenol (Lysol). Thermal traumas 
are caused by ingestion of hot liquids or solids or inhalation of 
hot gasses. Irradiation burns of the larynx occur as a side 
effect of the treatment of malignancies of the neck. Techni- 
cally, vocal nodule and contact ulcer might be included, for 
they too are lesions secondary to trauma and caused by vocal 
abuse; however, they are discussed in separate sections. 


Esophageal Reflux and Granulomas 


Esophageal reflux, or gastroesophageal reflux (GER), the regur- 
gitation of hydrochloric acid from the stomach, is an internal 
caustic substance that has been implicated in vocal fold irrita- 
tion, contact ulcers, and dysphonia. Inquiry into daytime and 
nocturnal acid reflux is an important component of the voice 
disorder history in patients with or without observed contact 
ulcer or contact ulcer granuloma. In a study of 32 patients sus- 
pected of having esophageal reflux, Koufman, Winer, Wu, et al. 
(1988) found either intermittent or chromic dysphonia in 
65.6% of the patients. In one subgroup of five patients who pre- 


sented with chronic hoarseness, examinations revealed ery- 
thema and edema of the posterior larynx, laryngeal granulo- 
mas, and bouts of hoarseness related to reflux history. 

Laryngeal granulomas, as discussed elsewhere in this text, 
are presumed to be related to four predisposing factors: 
vocal abuse, esophageal reflux, orotracheal intubation, and 
morphologic structure (Pontes, DeBiase, Kyrilos, et al., 2001). 
Knowledge of the presence of acid reflux in a majority of 
chronic hoarseness is not new (Wiener, Koufman, Wu, et al, 
1989). Because it it common and symptoms differ gastroe- 
sophgeal reflux (GER), hoarseness treatment should include 
some focus on managing reflux (Toohill, Kuhn, 1997; 
deLima-Pontes, De Biase, Gadelha, 1999; Emami, Morrison, 
Rammage, et al., 1999; Havas, Priestly, Lowinger, 1999; 
Cohen, Back, Postma, et al., 2002), in combination with 
behavioral management of the voice disorder. 

Because granulomas tend to recur after surgical removal, 
Havas and colleagues have stated that, “the only indications 
for laryngeal surgery are to resolve diagnostic doubt or to 
treat airway compromise.” Alternatives to surgical interven- 
tion of granulomas are treatment with application of topical 
inhalant steroids (Row, Goh, Chon, et al., 1999) or treatment 
with intralaryngeal injection of botulinum toxin (Orloff and 
Goldman, 1999). Treatment with Botox (botulinum toxin) 
causes a temporary paresis of the vocal folds providing for a 
window of time during which the vocal process can heal 
and the granulomas resolve because it is not exposed to 
ongoing intermittent contact and friction with the opposing 
arytenoid. 


Bowing of the Vocal Folds 


Bowing of the vocal folds is a term that describes an elliptical 
glottal shape during phonation due to failure of adequate 
anteroposterior tension of the vocal folds, laryngeal scarring 
as in vocal sulci, or to loss of tissue mass. Instead of the 
membranous portions of the folds adducting with straight 
vocal edges, they are curved. The intercartilaginous portions 
or the vocal folds are adducted; only the membranous por- 
tions fail to adduct. Causes of bowing are (1) atrophy of the 
thyroarytenoid muscle owing to denervation; (2) bilateral 
cricothyroid muscle weakness due to superior laryngeal 
nerve lesions resulting in failure of anteroposterior stretch- 
ing of the vocal folds; (3) loss of vocal fold tissue in the aging 
larynx; (4) iatrogenic lesions; (5) idiopathic lesions; and (6) 
infraglottic edema extending from the anterior commissure 
to the posterior larynx creating a pseudosulcus and bowed 
appearance. 

Not all vocal fold bowing is due to structural changes; it 
can occur in patients with psychogenic voice disorders and 
has been observed in the hypokinetic dysarthria of Parkin- 
son’s disease (Hanson, Gerratt, and Ward, 1984) and in the 
flaccid dysphonia of myasthenia gravis (Neiman, Mountjoy, 
and Allen, 1975). 

Regardless of the etiology, the associated dysphonia is 
breathiness and, in extreme cases, aphonia (LeJeune, Guist, 
Samuels, 1983; Kahane, 1987; Koufman, 1986; Tucker, 1985). 
Bowing of the vocal folds has also been observed in the hypo- 
kinetic dysarthria of Parkinson’s disease (Hanson, Gerratt, and 
Ward, 1984) and in the flaccid dysphonia of myasthenia 
gravis (Neiman, Mountjoy, and Allen, 1975). 
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¢ Summary 


Voice disorders caused by structural changes in infants 
arise from congenital laryngeal malformations, neo- 
plasms, inflammation, trauma, or malnutrition. 


Voice disorders caused by mass lesions in children and 
adults stem from inflammation, neoplasms, endocrine 
disease, or trauma. 
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Chapter 4 


Nasal Resonatory Disorders 


For when we wish to emit voice or breath through the 
mouth only, which may happen in the pronunciation of the 
letters which I shall name explosives, we shut the passage of 
the nostrils, as with a valve. 


—Amman, 1700 


¢ The Velopharynx in Life 


The isthmus between the oral and nasal cavities must be 
closed during swallowing and speech or else food will be 
regurgitated out the nares, vowels will be excessively res- 
onated in the nasal chambers, pressure consonants will 
lose their characteristic explosive and friction noises, and 
air will be audibly emitted through the nares. In Chapter 1, 
we noted that phonation was a phylogenetically recent 
acquisition among vertebrates and that the vital functions 
of the larynx were respiratory and protective of the airway. 
Much the same can be said about velopharyngeal function. 
The velopharyngeal port was not designed primarily as a 
means of coupling or uncoupling the oral and nasal cavi- 
ties for speech. Rather, velopharyngeal closure exists pri- 
marily to: 


1. Seal off the nasal from the oral cavities to isolate the 
oropharyngolaryngeal tract from atmospheric pressure 
during deglutition, producing a partial vacuum to facili- 
tate compression of the food bolus by the tongue, 
cheeks, and pharynx, and thereby forcing it into the 
esophagus 


2. Open the eustachian tube during swallowing to ventilate 
the middle ear 


In only three phonemes of English, the nasal semivowels, is 
it permissible for the velopharyngeal valve to remain com- 


pletely open, coupling the oral with the nasal cavities: /m/, 
/n/, and /n/ (as in sing). The remainder of English phonemes 
are correctly produced with the velopharyngeal port closed, 
meaning all vowels and most consonants. If the port is left too 
open, voiced consonants will be excessively resonated in the 
nasal cavities, and intraoral air pressure for plosives, frica- 
tives, and affricates will be reduced. In an attempt to produce 
them, air escapes through the nares and results in a weak, 
nasally distorted production of the target sound. It should 
also be mentioned that a certain amount of nasal resonation 
of vowels is both normal and desirable. The borderline 
between normal and pathologic hypernasality often is a mat- 
ter of perceptual preference. This is most notable in different 
regional dialects some of which have increased nasal reso- 
nance as one of the distinguishing characteristics. Moreover, 
resonance balance is the focus of many voice therapy tech- 
niques designed to improve vocal quality and reduce laryn- 
geal tension. 

What must be remembered is that the nasal resonatory 
defect of hypernasality is not the only one produced by 
velopharyngeal insufficiency. Hypernasality may be the pri- 
mary one. Secondary or associated defects are excess nasal 
airflow, nasal emission articulatory distortions owing to loss 
of intraoral air pressure stolen by the nasal air leak, compen- 
satory articulatory and phonatory substitutions many of 
which are nonphonemic, articulatory distortions due to den- 
tal and palatal defects, and even dysphonia from compensa- 
tory vocal abuse. 

In medical and speech-language pathology practice, 
hypernasality and nasal air emission and hyponasality have 
similar diagnostic medical implications as dysphonia from 
laryngeal disease. That is, they signify structural, neurologic, 
or psychiatric pathologic conditions and therefore are essen- 
tial to medical differential diagnosis, as well as having devel- 
opmental, educational, social, psychologic, and occupational 
handicaps that require treatment. The following transcript of 
a deposition illustrates these many factors involved in 
velopharyngeal insufficiency. 
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Case Study 4.1 


Attorney: “Have you ever given a deposition before?” 

Clinician: “No, this is my first.” 

A: “Let me explain that a deposition is an official tran- 
script of testimony in place of your appearance in court. 
We are the attorneys for Ms. Alquist, who is suing the sur- 
geon who removed her tonsils and adenoids. She claims as 
a result of that surgery her speech was so impaired that 
she was placed on indefinite leave of absence without pay 
from her teaching position. She also claims that when she 
swallows, fluids leak out her nose. So, we are taking testi- 
mony from specialists who have independently investi- 
gated Ms. Alquist’s complaints. | understand you are the 
speech pathologist who evaluated Ms. Alquist’s speech, 
and that the plastic surgeon and otolaryngologist who also 
saw her will be following you. This is the official stenogra- 
pher who will take down your testimony. You will have a 
chance to read the transcript and make corrections before 
itis put into final form. Some of the questions | will ask may 
seem oversimplified, but remember, this testimony will go 
before a jury of lay individuals who have no technical 
knowledge of the subject. You have already stated your full 
name and address. What is your occupation?” 

C: “lam a speech-language pathologist.” 

A: “I notice you are addressed as ‘doctor.’ Are you a 
medical doctor?” 

C: “No. | hold a Ph.D. in speech-language pathology.” 

A: “Are you licensed to practice speech pathology?” 

C: “In this state there are no licensure laws for speech 
pathologists.” 

A: “Will you tell the court what a speech pathologist is 
and does?” 

C: “We identify, describe, diagnose, and treat people 
who have disorders of communication, that is, voice, 
speech, and language disturbances from organic or psy- 
chiatric illnesses. Where I work, speech pathologists con- 
sult to physicians of all specialties.” 

A: “Thank you. Now, do you know the plaintiff, and if so 
on what occasion have you conferred with her?” 

C: “I evaluated Ms. Alquist’s speech on May 21 of last 
year. That evaluation took approximately one and one- 
half hours.” 

A: “Are these the clinical notes that you wrote as a result 
of your evaluation?” 

C: “Yes, they are.” 

A: “Are these the audiotape and videotape recordings 
you made of the patient during your evaluation?” 

C: “Yes, they are.” 

A: “When you evaluate a patient’s speech, doctor, what 
exactly do you do?” 

C: “l ask the patient to perform different tasks that will 
provide me with an adequate sample of the patient’s 
speech. | usually tape record the speech for more detailed 
analysis later. | might also make certain laboratory tests, 
such as aerodynamic and spectrographic assessment, 
that is, acoustic recordings of the patient’s speech and 
recordings of how she uses air from her lungs for speech.” 

A: “Based on your analysis of Ms. Alquist’s speech, 
would you say her speech was abnormal?” 





C: “Yes, | would.” 

A: “In what way, specifically?” 

C: “She had hypernasality and nasal emission.” 

A: “Would you please explain the meaning of these 
words for the court.” 

C: “Hypernasality refers to excess resonation of speech 
sounds within the nasal chambers because of partial or 
complete failure of the soft palate to close off the nasal 
from the oral cavities, or from incomplete closure of the 
hard palate.” 

A: “You mean like cleft palate?” 

C: “Yes. But she did not have a cleft palate.” 

A: “Doctor, please point out on this chart the parts of 
the speech mechanism you are talking about.” 

C: “This is a side view of the head and neck. This plate of 
bone separates the oral from the nasal cavity. It is called 
the hard palate. This flap of tissue is called the soft palate 
or velum. It can lift upward and backward to contact the 
back wall of the throat. When it does, it seals off the oral 
from the nasal cavity.” 

A: “And, what if it doesn’t?” 

C: “As | explained before, then speech will be abnor- 
mally resonated in the nasal cavities. | would add that con- 
sonants will be heard as weak or absent if the opening is 
very large. People would then have trouble understanding 
what the speaker was saying.” 

A: “We will return to what you found out about the 
plaintiff's speech mechanism, but at this point, | would 
like you to explain more fully the implications of Ms. 
Alquist’s speech defect.” 

C: “As | said, she was hypernasal and had nasal emission 
of the airstream as she spoke. If | may, | would like to play 
a videotape of the patient so that you can hear and see 
what |’m talking about. As you can hear from the tape, her 
speech has a noticeable and distracting nasal quality, and 
some of her words are difficult to understand. We call this 
reduced intelligibility. Also, you can see she has a ten- 
dency to narrow her nostrils and wrinkle her forehead on 
many consonant sounds. This is a kind of compensation 
people with palatal insufficiencies develop. It is an 
attempt to constrict the nostrils so that less air will pass 
out through them.” 

A: “How would this disorder interfere with the patient’s 
everyday life?” 

C: “A speech defect is a personal matter. Two people 
with defects of the same type and severity often react dif- 
ferently, depending upon their personality and the impor- 
tance of speech in their daily lives.” 

A: “And, how would you characterize the effect of Ms. 
Alquist’s defect on her?” 

C: “Based on the psychologic and social history portion 
of my examination, Ms. Alquist was devastated by her 
speech disorder.” 

A: “What is your evidence for that statement?” 

C: “She broke down several times during the examina- 
tion, especially during discussions of the effects of her 
disorder on her professional activities. She said her 
speech had always been important to her; that she used 
her speech to favorable advantage in the classroom, that 
she knew she had a good voice, would often dramatize 


segments of the curriculum through plays and story- 
telling, and that she taught singing. Outside of the class- 
room, she was active in teachers’ organizations and was 
known as a good public speaker and debater.” 

A: “Has this changed as a result of her disorder?” 

C: “It has, according to what she told me, turned her life 
around completely. Her students, she said, became inat- 
tentive. She began to notice derisive facial expressions of 
students and colleagues as she would speak. Some of the 
children made fun of her. Speaking tired her out—she had 
to exert much more effort to make herself understood.” 

A: “Has her speech disorder affected her in any other 
ways?” 

C: “According to Ms. Alquist, she was called into the prin- 
cipal’s office and told that she would have to take a leave of 
absence until something was done about her speech. That 
was a year ago. She has not worked since. She says she has 
become withdrawn, has a fear of using the telephone, and 
has been seeing a psychiatrist for depression.” 

A: “Have any attempts been made to correct, treat, or 
rehabilitate her speech?” 

C: “Not to my knowledge. No one recommended 
speech therapy in her locality.” 

A: “Doctor, let’s get back to the physical examination of 
the plaintiff. What, in your opinion, was the reason she 
developed this disorder at this time in her life? Why would 
anyone whose speech had been not only normal, but who 
was in fact a skilled speaker, suddenly acquire this kind of 
defect?” 

C: “I would feel more comfortable if you asked the plas- 
tic surgeon and otolaryngologist that question, although | 
think we agree, having discussed the case in private, on 
the cause of the disorder.” 

A: “Nonetheless, the court is interested in your opinion; 
as an expert witness in your field, you are justified in com- 
menting on the physical aspects of the case.” 

C: “Well, we have photographs of the patient’s mouth 
and x-ray motion pictures of her soft palate during 
speech. These lateral motion picture studies clearly show 
that, although the soft palate is able to lift up, it is too 
short to make closure contact against the back wall of the 
throat; it elevates, but there is a considerable gap left 
between it and the posterior pharyngeal wall. During our 
examination, although the hard and soft palate looked all 
right when the patient opened her mouth, when we 
explored her hard palate we felt a notch at the back where 
the soft and hard palate join. This notch is called a submu- 
cous cleft. We also noticed, and we almost missed this 
because it was so subtle, that the little pendulum of tissue 
that hangs down from the middle of the soft palate, the 
uvula, was almost invisibly split; you could separate it with 
a tongue depressor, but most of the time the two halves 
stuck together so that it looked like solid tissue. You can 
see the split on this photograph.” 

A: “Would you give us the implications of all this?” 

C: “We think Ms. Alquist has had a lifelong condition of 
submucous cleft palate, a cleft in the bone and muscular 
tissue that is hidden under the mucosal tissue. In her case, 
a speech disorder was not evident because she was able to 
attain adequate closure by contacting her soft palate 
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against her adenoids. In other words, the adenoids were 
serving as an anatomic structure on the back wall of the 
throat and were helping to effect complete closure; when 
they were removed, the space created was too great for 
her soft palate to bridge. The adenoid removal unmasked 
the submucous cleft.” 

A: “Is this condition remediable?” 

C: “A prosthetic device called a palatal lift would par- 
tially close the gap. More permanent would be a surgical 
procedure, a pharyngeal flap operation, in which a flap of 
tissue raised from the pharyngeal wall is sutured to the 
soft palate, or a sphincter pharyngoplasty.” 

A: “We want to thank you for your testimony. If we have 
any further questions, we'll arrange another appointment 
through your secretary.” 


As this testimony illustrates, although hypernasality and 
nasal emission are only aesthetically displeasing when mild, 
when excessive, they diminish or even destroy intelligibility, 
producing serious occupational and psychologic distur- 
bances. When hard palate or velopharyngeal port defects are 
left untreated in infancy or early childhood, they may impede 
speech and language production development and negatively 
influence the self-concept. Even when they occur for the first 
time in adulthood, as in the case just presented, they can pro- 
duce profound psychologic and occupational maladjust- 
ments. One might ask, “Which would be more devastating: 
aphonia (whispered speech) from laryngeal disease or severe 
hypernasality and nasal emission from velopharyngeal or 
palatal insufficiencies?” Our first impulse might be to answer 
that the aphonia would have more profound consequences. 
In actual fact, loss of normal velopharyngeal closure can be 
much more serious, because without adequate intraoral pres- 
sure, most consonants lose their identity. This fact is amply 
illustrated in isolated paralyses of the soft palate; respiration, 
phonation, and articulation are normal, yet, speech is unin- 
telligible solely on the basis of diminished resonance balance. 
When there is a leak of air into the nasal cavity, it is difficult 
to build up intraoral pressure and to maintain speech motor 
coordination necessary for intelligible speech. 


+ Nasal Resonatory Disorders 


Classification and Definition of Nasal 
Resonatory Disorders 


Nasal resonatory disorders can be classified into four main 
auditory-perceptual types: 


1. Hypernasality and nasal air emission 


a. Hypernasality Synonyms are rhinolalia aperta, hyper- 
rhinolalia, and open nasality. Defined as excess reso- 
nance of vowels and voiced consonants within the 
nasal cavities. The anatomic-physiologic basis is open 
coupling between the oral and nasal cavities due to 
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incomplete closure of the hard palate and/or velopha- 
ryngeal sphincter. 


b. Nasal air emission Defined as abnormal flow of air 
from the nares during the production of high-pressure 
consonants, it is measurable biophysically via airflow 
(aerodynamic) techniques. It is heard as friction noise 
accompanying or replacing the target consonant and 
may have a “gurgling” quality if there is coexisting nasal 
congestion. It is sometimes referred to as a nasal rustle. 


c. Nasal and/or facial grimacing Defined as occlusion of 
the nares by contracting the alae of the nose, with 
associated wrinkling of the forehead, during consonant 
production, a compensatory action in an attempt to 
restrict the nasal flow of air. 


2. Hyponasality Synonyms are denasality, rhinolalia clausa, 
and closed nasality. Defined as diminution or absence of 
normal nasal resonance of the nasal semivowels /m/, /n/, 
/y/ and loss of normal assimilation nasality. Its physical 
basis is over closure of the portal or obstruction between 
the oral and nasal cavities owing to space-occupying 
lesions within the nasopharynx or nasal cavities. Two 
subtypes are recognized: 


a. Rhinolalia clausa, posterior A type of closed nasality in 
which the nasal semivowels lose their normal resonance 
because of an obstruction in the posterior region of the 
nasal cavities or the nasopharynx. The nasal phonemes 
Im], /n/, and /n/ are heard as oral stops /b/, /d/, and /g/. 


b. Rhinolalia clausa, anterior A type of closed nasality in 
which all the vowels and nasals are produced with a 
hollow-sounding resonance owing to obstruction in 
the anterior region of the nasal cavities. It is also called 
cul-de-sac resonation. 


3. Mixed nasality A synonym is rhinolalia mixta. Defined as 
simultaneous velopharyngeal insufficiency and nasal 
obstruction resulting in a hollow resonation of all vowels 
and voiced consonants. 


4. Resonance imbalance Resonance imbalance is most often 
exhibited in persons with hyperfunctional voice disor- 
ders. Perceptually, they may sound neither hypr-o-nor 
hypernasal. Voice quality is most likely to sound throaty, 
tight, and thin. Stroboscopic recordings of the vocal folds 
often reveal a long, closed phase and considerable medial 
or anteroposterior (A-P) constrictions of the larynx. 


Etiology of Nasal Resonatory Disorders 


The etiologies of nasal resonatory disorders are outlined in 
Table 4.1. Their causes can be subdivided into organic and 
psychogenic. Organic causes of hypernasality and nasal 
emission are anatomic defects, such as clefts of the hard and 
soft palate, obstructive tonsils, nasal structural deviations; 
iatrogenic from surgical removal of some portion of the 
palate; and neurologic disease producing paresis or paralysis 
of the velopharyngeal musculature. Nonorganic causes of 
hypernasality and nasal emission are conversion reaction, 
immature personality development, poor motivation, and 


Table 4.1 Etiology of Nasal Resonatory Disorders 


Hypernasality and nasal emission 
Organic 
Anatomic 
Overt cleft palate with or without cleft lip 
Submucous cleft palate 
Congenitally short soft palate or large nasopharynx 
Traumatic structural damage 
Neurologic (dysarthria) 
Lower motor neuron (flaccid) 
Unilateral upper motor neuron (“flaccid”) 
Bilateral upper motor neuron (spastic) 
Mixed lower-upper motor neuron (flaccid-spastic) 
Hyperkinetic (dystonicchoreic) 
Psychogenic 
Conversion reaction 
Immature personality 
Poor motivation 
Imitative 
Hyponasality 
Organic 
Hypertrophied adenoids 
Tumors 
Inflammations 
Postsurgical repair 
Patulous eustachian tube 
Nasal deformity 
Mixed nasality 
Combinations of organic etiologies of hypernasality and 
hyponasality 


imitation. Hypernasality is also common in persons with 
severe, congenital sensorineural hearing loss owing to poor 
auditory feedback. Velopharyngeal incompetence is associ- 
ated with genetic conditions and identifiable syndromes 
(Shprintzen, 2000a, b). Thus, velopharyngeal incompetence 
may result from an isolated cause or may be multifactorial in 
origin resulting from learning, reduced sensory feedback, 
poor motor control, and/or morphologic deviations in the 
speech structures. 

Causes of hyponasality are primarily organic: space-occu- 
pying lesions, such as tumors, inflammations, nasal polyps, 
and hypertrophied adenoid tissue, and structural deformi- 
ties, such as deviated nasal septum. 

Mixed nasality is caused by coexisting velopharyngeal 
insufficiency and obstruction of the nasal passageways. 


¢ Anatomy and Physiology of the 
Velopharyngeal Mechanism 


Surface Anatomy 


The anatomy that pertains to normal and abnormal nasal res- 
onatory functions consists of the oral, nasal, and pharyngeal 
cavities bounded by muscles and their bony attachments. The 
x-ray in Fig. 4.1 illustrates the separation of the oral from the 
nasal chambers by the bony hard palate. Suspended from its 
posterior edge is the resting soft palate, or velum. 
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Figure 4.1 


Lateral 
pharyngeal wall 


Because a lateral view of the velopharyngeal port only 
affords a two-dimensional appreciation of this valve, vertical 
and anteroposterior views are needed, shown in Fig. 4.2. The 
port is bounded anteriorly by the velum and laterally and 
posteriorly by the lateral and posterior pharyngeal walls. Its 
elliptical-shaped opening hints at its pattern of closure, 
which is often sphincteric, similar to a drawstring or camera 
aperture. 

Another view of the hard and soft palate is through the 
open mouth, although this approach affords limited informa- 
tion in comparison with x-rays or nasal endoscopy. Clinicians 
using intraoral examination alone may make erroneous deci- 
sions about the competency of the structures because they 
are unable to see how adenoid pads or posterior pharyngeal 
wall and velum work together to achieve closure. Also, a 
wide-open mouth may artificially change the tension of the 
oral structures and subsequent movement patterns used to 
produce normal resonance. Figure 4.3 shows the soft palate 
with its two symmetric arches. Suspended from its midline is 


È -n the uvula. Running down from the arches on either side of the 


“ soft palate are two folds of tissue. The one closer to the front 


Lateral radiograph showing the hard and soft palate. of the mouth is the anterior pillar of the fauces and the one 






Posterior 
pharyngeal wall 





Figure 4.2 Three-dimensional view 


of a section through the velopharyn- 
Velum geal port. 
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Uvula 






Posterior pillar 
of the fauces 


Anterior pillar 
of the fauces 


Figure 4.3 The soft palate and surrounding structures as seen on per- 
oral examination. 


toward the rear is the posterior pillar of the fauces. Nestled 
between them are the palatine tonsils. 

The roof of the mouth is bordered by the upper dental 
arch and is covered by ridged mucosal tissue under which 
lies a thick periosteal tissue covering. The anterior two 
thirds of the roof consist of the hard palate and the posterior 
one third the soft palate. It is difficult to see where one ends 
and the other begins, but their junction is easy to feel by 
running the index finger along the midline of the hard 
palate from front to back until the finger sinks into soft tis- 
sue marking the border between the hard and soft palate. 
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Bones 


The velopharyngeal musculature is suspended from three 
bones: the hard palate, the sphenoid bone, and the temporal 
bone. 


Hard Palate 


The hard palate is formed by the palatine processes of the max- 
illae and the horizontal plates of the palatine bones (Fig. 4.4). It 
is covered by a dense periosteum surfaced by mucous mem- 
brane. When stripped to the bone, the hard palate reveals 
that it is divided by sutures. The palatomaxillary suture 
divides the more anterior, larger palatine process of the max- 
illa from the smaller, more posterior horizontal plate of the 
palatine bone. Its posterior margin serves as the attachment 
of the soft palate. At its midline is a sharp spicule of bone 
called the posterior nasal spine. 

The two halves of the palatine processes of the maxilla are 
joined in the midline at the intermaxillary suture. The two 
halves of the horizontal plates of the palatine bone are joined 
in the midline at the interpalatine suture. The opening in the 
anterior hard palate is the incisive fossa and in the lateral 
portions of the palatine bone the greater palatine foramina. 
Medial and posterior to these are the lesser palatine foram- 
ina. 


Sphenoid Bone 


The sphenoid bone lies at the base of the skull in front of the 
temporal bones and the basilar part of the occipital bone 
(Fig. 4.5). The lateral pterygoid plate and especially the 
medial pterygoid plate with its pterygoid hamulus are por- 
tions of the sphenoid bone directly concerned with the soft 
palate musculature. 


alatine process 
of maxilla 


f Interpalatine 
suture 





Figure 4.4 The hard palate; inferior view. 
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pterygoid plate 


Figure 4.5 The sphenoid bone and schematic 
drawing of levator and tensor veli palatini mus- 
cles; posterior view. 
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The temporal bones form the sides and base of the skull, the 
latter of which is a point of attachment for the soft palate 
musculature. 


Blood Supply 


The blood supply of the hard and soft palate comes from dif- 
ferent vascular vessels, and their vascular architectures have 
their own character (Zhong, Zhang, and Wang, 2001). The 
greater palatine artery provides the major supply to the hard 
palate and the ascending palatine artery is the principal 
source for the soft palate. The two anastomose at the junc- 
tion of the hard and soft palate putting it at risk during palate 
repair. The greater palatine artery emerges on the oral sur- 
face of the palate through the greater palatine foramina, runs 
forward in a groove near the alveolar border of the hard 
palate through the incisive canal, and, linking up with the 
sphenopalatine artery, is distributed to the gums, palatine 
glands, and mucous membrane of the roof of the mouth. The 
branches of the greater palatine artery form rich anastomoses 
with neighboring vessels. The artery branches are arranged in 
lines with a tree-like appearance. The capillary network is 
loosely packed. In contrast, the arteries in the soft palate are 
crooked, exhibit directional changes, and the capillary net- 
work is densely packed. 


Muscles 
Levator Veli Palatini 


This muscle arises from the base of the skull anterior to the 
carotid canal and descends anteromedially along the pos- 
teroinferior border of the eustachian tube. It inserts into 
the palatine aponeurosis of the middle third of the soft 


the opposite side and with the remaining muscles of the 
soft palate. Immediately prior to entering the velum, the 
levator divides into two parts. The smaller bundle of mus- 
cle fibers (anterolateral part) runs anteriorly, close to the 
lateral pharyngeal wall, and inserts into the palatine 
aponeurosis through several fine tendons creating a tendi- 
nous insertion. The main part of the muscle runs medially 
into the velum, where it fans out and forms the levator 
sling with the contralateral levator. Inadequate release of 
the tendinous insertions at the time of palate repair may 
tether the levator anteriorly and compromise muscle 
retropositioning or may split the levator rendering only 
part of it retropositioned. 


Tensor Veli Palatini 


This flat, triangular-shaped muscle originates from the base 
of the skull between the spine of the sphenoid bone, the root 
of the pterygoid process, and the eustachian tube. The mus- 
cle ends in a tendon that makes a right-angle turn around 
the pterygoid hamulus and then runs horizontally into the 
palatine aponeurosis along with the other muscles of the 
soft palate. Secondary insertions are present on the maxil- 
lary tuber and/or in the submucosal tissue near the 
palatoglossal arch. It appears to act as the dilator tubae (Abe, 
Murakami, Noguchi, et al., 2004). The length of the tensor 
veli palatine muscle attachment and its ratio to the length of 
the eustachian tube, particularly the cartilaginous portion, 
increases with age from infancy to adulthood, and decreases 
with age from young adulthood to later life and helps explain 
its functions relative to the eustachian tube (Suzuki, Sando, 
Balaban, et al., 2003). The tensor tympani and tensor veli 
palatine muscles make a functional unit (Kierner, Mayer, and 
Kirschhofer, 2002). 
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Palatoglossus 


The palatoglossus forms the anterior pillars of the fauces. It 
is a small, thin muscle that arises from the palatine aponeu- 
rosis and descends to insert into the lateral border of the 
tongue, infiltrating its transverse muscle fibers. 


Palatopharyngeus 


This muscle forms the posterior pillars of the fauces. It is 
larger than the palatoglossus muscle. It has three compo- 
nents of pharyngeal origins and two heads of velar insertions 
that provide the continuity between the velum and laryngeal 
pharyngeal wall. It originates from the back and side walls of 
the pharynx and from the thyroid cartilage and ascends to 
the posterior and lateral portions of the soft palate. Some of 
its fibers insert into the tip of the medial cartilaginous wall of 
the eustachian tube and are considered by some anatomists 
to compose a separate muscle, the salpingopharyngeus. 
However, the bulk of the fibers extend to the posterior ton- 
sillar pillars. This suggests that the palatopharyngeus is not 
only a major musculature of lateral pharyngeal wall motion 
but also plays an important role in velar function (Cheng and 
Zhang, 2004). 


Constrictor Pharyngis Superior 


The superior pharyngeal constrictor forms the lateral and 
posterior walls of the velopharyngeal port. It attaches to the 
pterygoid process, the pterygomandibular raphe, the 
mandible, and the tongue. Its most cranial portion, which 
attaches to the medial pterygoid plate and to the pterygoid 
hamulus, is most directly active in velopharyngeal closure. 


Musculus Uvulae 


Each bundle of the paired uvulae muscle originates lateral to 
the midline from the tendinous palatal aponeurosis posterior 
to the hard palate and just anterior to the insertion of the lev- 
ator veli palatine. The paired muscles converge in an area 
overlying the sling of the levator muscle and course along the 
dorsum of the soft palate terminating as two separate bun- 
dles that subdivide and insert between the mucous glands of 
the uvula proper into the connective tissue and basement 
membrane of the mucosa (Azzam and Kuehn, 1977). 


Velopharyngeal Biomechanics 


Velopharyngeal closure is produced by contracting the levator 
veli palatini, tensor veli palatini, and superior pharyngeal con- 
strictor muscles, actions shown in the schema of Fig. 4.5. The 
levator is the most important muscle of velopharyngeal clo- 
sure during speech. It elevates the midportion of the soft palate 
in a superior-posterior direction. The tensor does not con- 
tribute directly to velar elevation and retraction during speech, 
although upon being pulled taut it may raise the velum to the 
level of the hard palate. It is alleged that it also depresses the 
anterior velum opening the eustachian tube during swallow- 
ing. The superior pharyngeal constrictor narrows the port fur- 
ther by moving its walls medially and anteriorly. 


Contraction of the superior pharyngeal constrictor some- 
times produces a slight bulge or transverse fold in the poste- 
rior pharyngeal wall that is crescent-shaped or shelf-like in 
appearance. It is called Passavant’s ridge, first described in 
1869 by Passavant as a discrete bar or protuberance that 
forms on the posterior pharyngeal wall during speech only 
and consists of pharyngeal mucous and/or pharyngeal 
superficial muscles. Nasopharyngoscopic and multiview 
videofluoroscopic studies have revealed two types of for- 
ward movement of the posterior pharyngeal wall: one is 
movement of Passavant’s ridge and the other less sharply 
outlined is movement of the posterior pharyngeal wall 
created by contraction of a flexor muscle of the head 
(Yamawaki, 2003). Regardless of which type of forward 
movement, this prominence occurs in many normal speak- 
ers, in whom it may be of little importance, and in a much 
higher number of those who have velopharyngeal incompe- 
tence with and without cleft palate, in whom it probably 
contributes toward closure. 

Velopharyngeal opening is not completely passive and 
due only to muscular relaxation and gravity. It is not known 
for sure, but the soft palate may have an active antagonist, 
the palatoglossus, which pulls the soft palate downward 
and forward. This movement may be related, at least in part, 
to the palatoglossus’ role in respiratory activity and 
response to negative upper airway pressure. The negative 
pressure and depressed palate assists inward flow of air into 
the lungs. Gravitational forces on the palatoglossus differ 
little between the upright and supine position. However, the 
levator veli palatine has reduced peak activation levels in 
the supine body posture where gravitational effects work in 
the same direction toward closure (Moon and Canady, 
1995). 

Although contraction of the musculus uvulae shortens the 
uvula, the importance of its contribution to swallowing or 
speech is in question. On the one hand, many normal people 
are missing the uvula for congenital or surgical reasons with- 
out any untoward effects, but on the other hand, some indi- 
viduals who have had uvulectomy surgery to reduce snoring 
have developed secondary velopharyngeal incompetence. 
Because of its location and size, it appears that contraction of 
the musculus uvulae adds bulk to the dorsal surface of the 
elevated soft palate aiding in occlusion of the velopharyngeal 
port during both speech and deglutition for some individuals 
(Azzam and Kuehn, 1977). Thus, congenital deficiency of the 
bulk of this muscle may be responsible for minor velopha- 
ryngeal insufficiency. Its capacity for tension production and 
the anaerobic metabolic activity of the muscle uvulae may 
also contribute to sleep apnea (Series et al, 1995, 1996, 1999). 


Videofluoroscopic Studies 


Static x-ray tomograms or radiographs cannot do justice to 
the ceaseless undulations of the soft palate as it opens and 
closes the port during contextual speech that alternates 
between nasal and non-nasal phonemes. But such photo- 
graphs can allow for the beginning of understanding of the 
closure mechanism. In Fig. 4.6, an x-ray tomogram of a nor- 
mal person just beginning to produce the vowel /i/, the soft 
palate is beginning to rise. Below, the soft palate is elevated 
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and retracted against the posterior pharyngeal wall. A basal 
view of this action shows the sphincteric configuration of 
this closure. From their multiview radiographic classic stud- 
ies, Skolnick and colleagues (1969, 1970, 1973) have immea- 
surably enriched our understanding of velopharyngeal 
closure and concluded: 


We believe that concepts which view velopharyngeal clo- 
sure as resulting from a combination of velar (flap valve) 
and lateral pharyngeal wall movement are only partially 
correct. It appears that the mechanism is really a sphinc- 
teric one poorly appreciated because of the difficulty in 
visualizing the velopharyngeal portal in the manner neces- 
sary to show the sphincteric movements. 


Rather than conceptualizing soft palate and lateral and 
posterior pharyngeal wall movements as separate during 
closure, Skolnick showed radiographically that all move as 
a single, functional unit, and there is no precise demarca- 
tion between the lateral aspects of the velum and the 
beginnings of the pharyngeal wall. Although a lateral view 
is useful for demonstrating the vertical extent of the pha- 
ryngeal contribution to the sphincter, it is the basal view 
that permits a view of the port en face and gives us a true 
appreciation for the sphincteric concept of velopharyngeal 
closure. 

Since this time, a variety of visualization techniques have 
been used to further our understanding of this mechanism. 
We now know that there is variable contribution to 
velopharyngeal closure from the velum, the lateral pharyn- 
geal walls, and posterior pharyngeal wall from person to 
person and that these closure patterns change within indi- 


Figure 4.6 Videofluoroscopic frames showing lateral 
and base views of normal velopharyngeal closure. (From 
Skolnick, M.L. [1970]. Videofluoroscopic examination of 
the velopharyngeal portal during phonation in lateral 
and base projections. Cleft Palate | 7, 803-816.) 


viduals across the life span (Siegel-Sadewitz and Shprint- 
zen, 1986). 


Nasal Endoscopy 


Nasal endoscopy has become a routine clinical measure to 
determine sphincteric closure. Using a flexible fiberoptic 
bundle connected to a light source and camera, clinicians are 
able to observe and record velar function from a superior 
position during a variety of speech contexts. Many argue that 
you can get the same information with nasal endoscopy that 
you can obtain from videofluoroscopy without radiation 
exposure. Others suggest that videofluoroscopy is prefer- 
able. Havstam, Lohmander, Persson, Dotevall, Lith, and 
Lilja (2005) investigated how different visual assessment 
information influences the recommended treatment for 
velopharyngeal insufficiency (VPI). They compared recom- 
mendations made from videofluoroscopy, videofluoroscopy 
combined with nasal endoscopy, and nasal endoscopy alone. 
They reported that treatment recommendations were the 
same for 68% regardless of the type of visual information but 
that the percent agreement was higher with videofluo- 
roscopy in the lateral projection (84%) and when the lateral 
was combined with nasoendoscopy. They concluded that 
lateral projection videofluoroscopy should be the first step in 
visualizing VPI and nasoendoscopy should be a second step 
when additional information is needed. Others have sug- 
gested that nasoendoscopy should be the first step because it 
has the potential to determine not only if there is velopha- 
ryngeal incompetence but also the type of surgery needed. 
For example, a study by Armour, Fischback, Klaiman, and 
Fisher (2005) related speech outcomes to velopharyngeal 
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closure pattern and type of pharyngoplasty. They concluded 
that pharyngeal flap pharyngoplasty was more effective in 
treating velopharyngeal insufficiency in patients with circu- 
lar or sagittal velopharyngeal closure and less effective in 
treating coronal closure pattern. Coronal pattern velopha- 
ryngeal insufficiency was more appropriate for sphincter 
pharyngoplasty. Because these closure patterns are easily 
visualized with nasoendoscopy, it supports the notion of its 
routine use with this population. 

Two additional arguments for using nasoendoscopy in 
cases of velopharyngeal competence come from observa- 
tions of the velocardiofacial syndrome (also known as 
DiGeorge sequence and 22q11.2 deletion syndrome). This 
syndrome, or sequence, is now recognized as the most com- 
mon syndrome associated with cleft palate and VPI. It is 
associated with cleft palate (69%), heart defects (74%), and 
characteristic facial deformities and learning difficulties (70 
to 90%). There is phenotypic overlap with DiGeorge syn- 
drome and velocardiofacial syndrome. It occurs because 
chromosome 22, particularly band 22q11.2, is predisposed to 
rearrangements because of misalignments of low-copy 
repeats (LCRs). The size and size of the deletion in many 
velocardiofacial syndrome and DeGeorge syndrome patients 
do not differ (Luerssen, 2004). This syndrome also has been 
associated with microwebs and with medially positioned 
internal carotid arteries that may be associated with poste- 
rior pharyngeal pulsations and both of which can be seen on 
endoscopy. Thus, an important role of endoscopy in preoper- 
ative assessment of children for palatopharyngoplasty is 
observation of posterior pharyngeal wall pulsations that 
should alert clinicians to the diagnosis of 22q11.2 deletion. 
Knowledge of this deviation could be useful for preventing 
risk of damage to carotid arteries during velopharyngeal sur- 
gery. 

Similarly, Miyamoto et al (2004) have shown an associa- 
tion between anterior glottic webs and velocardiofacial syn- 
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drome (chromosome 22111.2 deletion). In 17 patients endo- 
scopically diagnosed with anterior glottic web, 65% were 
positive for chromosome 22q11.2 deletion, and of those 11 
patients only 5 showed subtle clinical manifestations of 
velocardiofacial syndrome, though with the aid of genetic 
testing all 11 were diagnosed with velocardiofacial syn- 
drome. 


Dynamic Magnetic Resonance Imaging 


Dynamic magnetic resonance imaging (MRI) recordings with 
near-real-time temporal resolution (real-time MRI) using a 
turbo spin-echo (TSE) sequence (TR = 170 milliseconds, 
TE = 21 milliseconds, slice thickness = 6 mm, six images per 
second) has also been used for analyzing velopharyngeal clo- 
sure during phonation. It compares favorably with multi- 
view videofluoroscopy. It appears to have the potential to 
depict the pattern of velopharyngeal closure in close correla- 
tion with videofluoroscopy and may deliver additional infor- 
mation in selected cases (Beer et al, 2004). 


Electromyographic Studies 


Another approach to the study of normal velopharyngeal 
closure is by electromyography (EMG). Studies do not fully 
agree on the extent to which each of the velopharyngeal 
muscles participates during closure (Bell-Berti, 1976; 
Benguerel, Hirose, Sawashima, and Ushijima, 1977; Cooper, 
1965; Li and Lundervold, 1958; Lubker, 1968; Lubker and 
Curtis, 1966; Seaver, 1979). This lack of agreement may be 
due to differences between subjects and the protocol used. A 
representative study of closure in normal subjects was done 
by Fritzell (1969), who placed EMG electrodes in the 
velopharyngeal musculature and measured their potentials 
during speech. A sample of recordings from different mus- 
cles is shown in Fig. 4.7. From his study he found: 


retentive dieters iit eo Hh th A 


Figure 4.7 Electromyographic recording of the levator veli 
palatini and palatopharyngeus muscles during speech. (From 
Fritzell, B. [1969]. The velopharyngeal muscles in speech. Acta 
Otolaryngol 250 [Suppl], 13.) 
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1. The levator veli palatini was continuously active during 
speech. 


a. Activity began before onset of sound. 

b. Activity ceased before termination of phonation. 

c. Degree of activity fluctuated during speech. 

d. Little or no activity occurred during nasal semivowels. 


2. The tensor veli palatini behaved with considerable inter- 
subject variability and in many cases with little or no 
activity. 


ww 


. Superior pharyngeal constrictor muscle activity was simi- 
lar to levator (1.e., active during all except the nasal sounds). 


4. The palatoglossus produced bursts of activity preceding 
nasal semivowels but was weak or inactive for oral 
sounds. However, there is conflicting opinion on the 
validity of this finding. 


5. The palatopharyngeus varied considerably among sub- 
jects; it always showed some activity but of low intensity. 


Fritzell drew the following conclusions from his study: 


— 


. The levator is the most important muscle of closure dur- 
ing speech. 


2. The tensor is nearly silent and has no typical relationship 
to speech. 


Ww 


. The superior constrictor has patterns similar to the levator. 


A 


. The palatoglossus actively pulls down the velum; lower- 
ing is not due only to gravity. 


a 


. The palatopharyngeus has little activity during speech. 


an 


. Ahigh positive correlation (0.76) exists between extent of 
EMG levator activity and extent of velar displacement. 


7. Velar movement begins within 40 milliseconds after 
onset of EMG levator activity and speech follows ~300 
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milliseconds later, varying widely, depending upon the 
initial phoneme and speaking situation. 


More recently, Velepic et al (1999) used EMG to investigate 
the velopharyngeal sphincter structures in 75 patients. They 
added an additional observation to those of Fritzell suggest- 
ing EMG was useful for diagnosing the exact causes of 
velopharyngeal insufficiency and for choosing the best 
approach for treatment. 


Nasal Airflow Studies 


The extent of nasal airflow can be measured by a variety of 
instruments. Most frequently, instrumental assessments 
use a pneumotachograph, which consists of a flowmeter 
and a differential pressure transducer. Airflow from the 
nose passes across a wire mesh screen. The resulting pres- 
sure drop is linearly related to the volume rate of airflow 
and is converted by a pressure transducer into an electrical 
voltage that is recorded and visualized on a computer mon- 
itor. 

Pressure flow studies summarized 40 years ago by Warren 
(1964) have yielded the following important data: 


+ Normal intraoral pressure for pressure consonants is 
between 3 and 7 cm H30. 


+ Normal consonant intraoral pressure cannot be main- 
tained if the velopharyngeal portal is open more than 
0.02 cm? (Fig. 4.8). 


Audible nasal emission and hypernasality occur when 
openings are between 0.1 and 0.2 cm?. 


+ Influencing the amount of nasal airflow are extent of oral 
airway opening, amount of nasal airway obstruction, and 
volume of the oral cavity. 


+ Normal speakers have imperceptibly small amounts of 
nasal airflow during production of non-nasal phonemes. 


Figure 4.8 Relationship between velopha- 
ryngeal orifice size and intraoral pressure dur- 
ing consonant productions. (From Warren, 
D.W. [1964]. Velopharyngeal orifice size and 
upper pharyngeal pressure-flow patterns in 
normal speech. Plast Reconstr Surg 33, 
148-162.) 
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+ Speakers vary in the duration of portal opening during 
production of nasal phonemes. 


Subsequent work by Warren and others has led clinicians 
to incorporate aerodynamic studies into their velopharyn- 
geal assessment clinical protocols. A combination of simulta- 
neously recorded nasal airflow and intraoral pressure during 
a variety of speech contexts appears to give the most valu- 
able information. Clinicians are able to calculate nasal resist- 
ance and to determine how phonetic contexts, increased 
orality, and change in volume and pitch impact velopharyn- 
geal valving. Several investigators have suggested that aero- 
dynamic measures change with age (Smith et al., 2003, 
2004). Pressure drops with age and nasal flow increases as 
do orifice areas. Variability in pressure decreases and vari- 
ability in nasal airflow increases. Smith, Patil, Guyetle et al., 
(2004) have shown that by age 14, children tend to exhibit 
pressures and flows similar to those of young adults. 


Acoustic Studies 


Hypernasality arises from coupling between the oral and 
nasal cavities. Acoustic studies of hypernasalized vowels 
produced by normal speakers show the following spectral 
distortions (Dickson, 1962; House and Stevens, 1956; Peter- 
son and Barney, 1952; Smith, 1951): 


reduced intensity of the first vowel formant 


+ one or more loci of antiresonance; that is, a sharp drop in 
the intensity of a portion of the spectrum 


extra resonances, contributed by the nasal cavities; that 
is, reinforced harmonics at frequencies at which energy is 
not normally expected 


+ a shift in the center frequencies of the formants 


Even patients with marginal velopharyngeal closure show 
changes in the acoustic signal (Zhu, Sun, and Wang, 1998); 
there are extra formants or strong nasal formants; spike and 
stop gap in stops and affricatives may be absent; there is 
increased noise in the lowest frequency; and the acoustic 
features differ for the same vowel produced in isolation 
compared with that produced in connected speech. 

Knowledge of the acoustic changes that occur with 
velopharyngeal incompetence has led to the development of 
several instruments designed to provide both quantitative 
and qualitative information on nasalance. One such com- 
monly used device is the nasometer. The Nasometer II (Key 
Elemetries, Lincoln Park, NJ) is a noninvasive device that pro- 
vides relative quantification of nasalance and visual feed- 
back of the magnitude. A sensing device is placed under the 
nose and above the upper lip. The amount of nasalance rela- 
tive to oral emission is expressed in percentage and imaged 
on a computer screen relative to a zero baseline. Data from 
this instrument have been reported for populations around 
the world. From these studies, several generalizations can be 
made about the clinical use of normative nasalance values 
(Tachimura, Mori, Hirata, et al., 2000; Van Lierde, Wuyts, 
De Bodt, et al., 2001; Whitehill, 2001): 


+ They should be based on speakers of the same language 
and geographic dialect. 


+ They are gender specific with females generally having 
less nasalance. 


+ They are context specific. 
They are age dependent. 


They are influenced by vocal intensity and orality. 


Perceptual Analysis of Speech 


Equally important as the instrumental assessment is the per- 
ceptual analysis of speech. For example, velopharyngeal gap 
size may be predicted from speech assessment alone. Nasal 
rustle suggests a small gap size and moderate to severe 
hypernasality a large opening (Kummer, Briggs, and Lee, 
2003). In the Kummer study, perceptual characteristics of 
speech correctly predicted gap size for 121 of 173 subjects 
(70%). The specific speech characteristics to listen for are 
explained elsewhere in this chapter. The point we want to 
underscore here is that from the first utterance produced by 
the patient, the clinician is making perceptual judgments of 
communication competence, observing and gathering valu- 
able information of both a quantitative and qualitative 
nature. Instrumental measures augment these observations 
but don’t replace them. 


+ Organic and Nonorganic Hypernasality 
and Nasal Air Emission 


Cleft Lip and Palate 


Clefts of the hard and soft palate are one of the most com- 
mon and serious of all causes of hypernasality and nasal 
emission. Because full coverage of this subject is not possible 
here, we especially recommend the selected readings at the 
end of this chapter. 


Definition and Classification 


Cleft lip and palate represent a failure of the lip and hard and 
soft palate segments to join during those stages of embryonic 
development when closure should have occurred. An infant 
with cleft palate at birth has a direct communication 
between the oral and nasal cavities. The air stream during 
speech passes, unchecked, into the nasal cavities and out of 
the nares. The speech effects are (1) hypernasality of vowels 
and voice consonants; (2) nasal emission of the air stream; 
and (3) weak pressure consonants due to loss of intraoral 
pressure. Communication may be further compromised by 
delayed articulatory and language development, hearing loss, 
and reactive psychologic and educational disturbances. In the 
not too distant past, this sequelae was a common occurrence 
in individuals with clefts. However, improved surgical pro- 
cedures that can be done at an earlier age have dramati- 
cally reduced the number and severity of communication 
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Table 4.2 American Cleft Palate Association Classification of Cleft Lip and Palate 


Clefts of Prepalate 
Cleft lip Unilateral 


Bilateral 
Median 
Prolabium 
Congenital scar 
Cleft of alveolar Unilateral 
Bilateral 


Median 


Cleft of prepalate 


Clefts of Palate 
Cleft of soft palate Extent 


Palatal shortness 
Submucous cleft 





Cleft hard palate Extent 
Vomer attachment 
Submucous cleft 
Cleft of soft and Clefts of prepalate 
hard palate and palate 


problems related to clefting. Despite these advances, how- 
ever, the majority of preschoolers with cleft palate continue to 
require direct speech therapy (Hardin-Jones and Jones, 2005). 

Several classifications of cleft lip and palate have been pro- 
posed. The most comprehensive is the American Cleft Palate 
Association's classification found in Table 4.2. 


Embryology 


The embryologic sequence of face and palate development is 
illustrated in Fig. 4.9 and is described as follows: 


1. Face 


a. Fourth week (early) Facial primordia begin to appear 
surrounding the primitive mouth or stomodeum. 


1) Frontonasal prominence. Forms the upper bound- 
ary of the stomodeum. Originates from prolifera- 
tion of mesenchyme ventral to the brain 


2) Maxillary prominences. Form the lateral bound- 
aries of the stomodeum. Originate from first 
branchial arch 








3) Mandibular prominences. Form the lower bound- 
ary of the stomodeum. Originate from the first 
branchial arch 


Right, left 

Extent in thirds 

Right, left 

Extent in thirds 

Extent in thirds 

Small, medium, large 

Right, left, median 

Extent in thirds 

Right, left 

Extent in thirds 

Right, left 

Extent in thirds 

Extent in thirds 

Submucous right, left, median 
Any combination of foregoing types 
Prepalate protrusion 
Prepalate rotation 

Prepalate arrest (median cleft) 








Posteroanterior in thirds 

Width (maximum in millimeters) 

None, slight, moderate, marked 

Extent in thirds 

Posteroanterior in thirds 

Width (maximum in millimeters) 

Right, left, absent 

Extent in thirds 

Any combination of clefts 
described under clefts of 
prepalate and clefts of palate 


(4) Mandibular prominences. Form the lower bound- 
ary of the stomodeum. Originate from the first 
branchial arch 


. Fourth week (late) 


1) Nasal placodes. Bilateral, oval-shaped thickenings 
develop from surface ectoderm in the lower part of 
each frontonasal prominence. 


2) Medial and lateral nasal prominences. Horseshoe- 
shaped prominences at margins of nasal placodes 
develop from mesenchyme. 


bo 
we 


Nasal pits. Depressions in the nasal placodes 


4) Maxillary prominences. Grow toward each other 
and toward medial nasal prominences 





5) Nasolacrimal groove. Cleft or furrow separating 
lateral nasal prominence from maxillary promi- 
nences 


c. Sixth and seventh weeks 


(1) Intermaxillary segment of upper jaw. Formed by 
merging of medial nasal prominences. This seg- 
ment gives rise to: 


(a) Philtrum: the middle portion of upper lip 
(b) Middle portion of upper jaw and gum 
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(2) 


3) 


4) 





5) 





Fourth week: late 


(c) Primary palate 


Maxillary prominences. Formed by lateral parts of 
upper lip, upper jaw, and secondary palate. Merge 
laterally with mandibular prominences reducing 
size of mouth 


Primitive lips and cheeks. Invaded by second 
branchial arch mesenchyme, giving rise to facial 
muscles 


Frontonasal prominence. Forms forehead and dor- 
sum and apex of nose. Sides of nose (alae) derive 
from lateral nasal prominences 


Final development of face 


(a) Nose changes from flat to more mature form 


Nasal placode 





Fourth week: late 


2. Palate 


Figure 4.9 (A-B) Embryonic and fetal development 
of the face and palate. 


MAYO 
©1988 


(b) Mandible more prominent 
(c) 
) 


(d) Eyes move medially 
( 


Forehead more prominent 


e) Ears rise 
The palate develops from two parts, the primary and 


secondary palate, between the fifth and the twelfth weeks. 
a. Fifth week (late) 


(1) 


Primary palate (median palatine process). From 
the innermost part of the intermaxillary segment 
of the upper jaw formed by merging of medial 
nasal prominences. Forms a wedge-shaped mass 
of mesoderm between maxillary prominences and 
developing upper jaw 
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Figure 4.9 (C-D) (Continued) Embryonic and fetal development 
of the face and palate. 


(2) Secondary palate. Formed by two horizontal pro- 
jections of mesoderm from the inner surfaces of 
the maxillary prominences called lateral palatine 
processes 


(a) Initially project downward on each side of 
tongue 


(b) Later, grow toward each other and fuse 
(c) Also fuse with primary palate and nasal septum 


(d) Fusion begins anteriorly during ninth week 
and is completed posteriorly by the twelfth 
week 


Cleft Lip 


Cleft lip is not a cause of hypernasality and nasal emission, 
but it is associated with clefts of the palate. They can range 
from a small notch in the vermillion border of the upper lip 
to a complete cleft into the floor of the nostril and through 
the upper alveolar ridge (Fig. 4.10): 


+ Unilateral cleft lip Unilateral cleft lip represents a failure 
of the maxillary prominence on the affected side to fuse 
with the medial nasal prominence. 


¢ Bilateral cleft lip Bilateral cleft lip results from failure of 
the maxillary prominences to fuse with the medial nasal 





prominence. The clefts on both sides may be similar or 
have different degrees of defect. 


+ Median cleft lip Median cleft of the upper lip is rare and 
represents a partial or complete failure of the medial nasal 
prominences to fuse and form the intermaxillary segment. 
Median cleft of the lower lip is also rare and represents a 
partial failure of the mandibular prominences to fuse. 


Cleft Palate 


Cleft palate represents failure of the lateral palatine processes 
to fuse with each other, with the nasal septum, and/or with 
the posterior margin of the median palatine process or pri- 
mary palate (Fig. 4.11). 

Palatal clefts may be of the uvula only, or they may 
extend through the soft and hard palates, and through the 
alveolar process and lip bilaterally. They may be overt or 
submucous: 


+ Clefts of the anterior (primary) palate These are clefts 
anterior to the incisive foramen. 


+ Clefts of the anterior and posterior palate These are clefts 
that involve both the primary and secondary palate. 


+ Clefts of the posterior or secondary palate These are clefts 
posterior to the incisive foramen. 
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Bilateral 


Incidence 


The worldwide incidence of cleft lip and palate is ~1 in 700 
and varies according to race, sex, geographic area, and source 
of statistical reports (Olasoji, Ukiri, and Yahaya, 2005). Ross 
and Johnston (Cooper, Harding, Krogman, Mazahari, and Mil- 
lard, 1979), after reviewing many surveys and selecting only 
those that they believed were adequately performed, 
showed a distinct gradient in the incidence of cleft lip and 
cleft palate among various racial groups: 


+ The lowest incidence was among black African Ameri- 
cans, ranging from 0.21 to 0.41 per 1000 live births. 


+ The highest was among Asians, with a range of 1.14 to 2.13 
per 1000 live births. 


In the intermediate group were American and Western 
European Caucasians, with an incidence between 0.77 
and 1.40 per 1000 births. 


Associated Anomalies 


Individuals with clefts show an increased incidence of asso- 
ciated malformations, the most common being cardiac 
problems, positional foot defects, extremity malformation, 
circulatory defects, and malformed ears, micrognathia, 
hypertelorism, exophthalmia, and microcephaly. Clefts of 


Bilateral R 3/3 L1/3 


Figure 4.10 Varieties of clefts of the upper lip. 


the palate are common in several syndromes, among which 
are the velocardiofacial syndrome, also known as DiGeorge 
sequence and 22q11.2 deletion syndrome (previously 
described), and Goldenhar, robin sequence, and Van der 
Woude syndromes. Because velopharyngeal incompetence is 
associated with genetic conditions and identifiable syn- 
dromes and because knowledge of the problems associated 
with the syndromes may impact management decisions, it is 
essential that the syndromes and comorbid conditions asso- 
ciated with clefting be identified. (For a comprehensive 
picture of syndromes associated with clefting and their 
influence on communication, see the Suggestions for Addi- 
tional Reading at the end of this chapter.) 


Etiology 


Both genetic and environmental factors have been impli- 
cated in the etiology of both cleft lip with or without cleft 
palate and isolated cleft palate. Although its multifactorial 
nature is puzzling, scientists have made significant break- 
throughs with respect to breaking the code related to the 
genetics of these conditions, in particular, characterization of 
the underlying gene defects associated with several impor- 
tant clefting syndromes (Cobourne, 2004). Cobourne suggests 
of particular significance is the identification of mutations in 
the interferon regulatory factor-6 (IRF6) gene as the cause of 
van der Woude syndrome and the poliovirus receptor 
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Bilateral lip Soft palate cleft 
and „with 
alveolar clefts bifid uvula 










Soft palate cleft 
with bifi uvula and 
short soft palate 





Soft palate cleft 
and submucous 
hard palate cleft 


Left Unilateral Bilateral complete cleft, 
Complete cleft, Lip and Palate 
Lip and Palate 


Figure 4.11 Varieties of clefts of the alveolar processes and hard and soft palates. 


related-1 (PVRL1) gene as being responsible for an autoso- 
mal recessive ectodermal dysplasia syndrome associated 
with clefting. Although no specific disease-causing gene 
mutations have been identified in nonsyndrome clefting, 
several candidate genes have been isolated through both 
linkage and association studies. Cobourne continues to say 
that although no specific disease-causing gene mutations 
have been identified in nonsyndromic clefting, several can- 
didate genes have been isolated through both linkage and 
associated studies. And whereas it seems clear that environ- 
mental factors also play a role, the interactions that occur 
between candidate genes and environmental factors during 
embryonic development remain unknown. 


Heredity Although heredity does not even account for 50% 
of occurrences of cleft lip and/or palate, it is one of the most 
important causes of cleft lip and palate to understand 
because of the role of the speech-language pathologist in 
counseling. Two identified genetic groups are (1) cleft lip 
alone and (2) cleft lip and cleft palate. Both are more com- 
mon in males. They have a positive family history in ~40% of 
cases. Cleft palate alone is more common in females. There is 
a positive family history in ~20% of cases (Edwards and Wat- 
son, 1980). Genetic counseling can identify high-risk fami- 
lies. 


The mechanism of inheritance is not always clear. Cleft lip 
and palate pedigrees tend to show a recessive mode, 
although dominance does occur. Cleft palate alone shows 
simple dominance. 

As yet, there is no way of predicting with accuracy 
whether a subsequent member of a family will have a cleft 
lip or cleft palate. The only exception to this is when a spe- 
cific syndrome is recognized that lends itself to simpler pat- 
terns of inheritance. However, statistical tables for risk rates 
are available: 

Risk rates for cleft lip (with or without cleft palate) (Fogh- 
Andersen, 1942) are as follows: 


¢ For siblings, one child affected, parents normal, 4%. 
¢ For siblings, one child affected, one parent affected, 14%. 
¢ For offspring, one parent affected, 2%. 


Risk-rate for cleft palate alone (Fogh-Andersen, 1942) is as 
follows: 


+ For siblings, one child affected, parents normal, 2%. 
¢ For siblings, one child affected, one parent affected, 17%. 


¢ For offspring, one parent affected, 7%. 
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Cleft palate (Fraser, 1970) rates are as follows: 


@ When one parent has a cleft palate, each child has roughly 
1 chance in 16 of being affected. 


+ Ifthe parents are unaffected but have a near relative with 
a cleft palate, the chances that these parents’ future chil- 
dren would be affected is ~1 in 15. 


+ If the parents of a cleft palate baby are unaffected and do 
not have an affected near relative, the chance that they 
will have another child with a cleft palate is ~1 in 50. 


+ Ifan affected parent has an affected child, then the risk for 
future children goes up to ~1 in 6. 


In cleft lip with or without cleft palate, if two offspring are 
affected, the risk goes up for subsequent offspring from 4 to 
9%. There is no change in risk with cleft palate alone. 


Environment Sixty percent of cleft lip cases alone, or cleft 
lip and cleft palate, and 80% of cleft palate cases alone appear 
to be solitary occurrences with no family histories of clefts. 
Although intrauterine causes, for example, anatomic and 
physiologic variations of the uterus, metabolic alterations, 
infections, insufficient folic acid, drugs, diet, smoking, and 
x-radiation are suspected, no compelling evidence for exoge- 
nous causes in these solitary cases has been found. Incom- 
plete family histories may account for some in this group, 
who really belong in the hereditary category. Others may be 
part of a syndrome that has not been identified. 


Speech and Language Defects 


No two patients with cleft palate, with or without cleft lip, 
have the same type and degree of communicative disorder. 
The end product of these palatal defects depends upon sev- 
eral factors: extent and location of the palatal defect; dental 
abnormalities; hearing loss; intelligence; cultural environ- 
ment; emotional stability; prior medical and surgical thera- 
pies. Speech and language defects that can occur include but 
are not limited to: 


+ hypernasality of vowels and voiced consonants 

nasal emission on pressure consonants 

weak pressure consonants 

+ nonstandard laryngeal, pharyngeal, and lingual sound 


substitutions for standard consonants (i.e., compensatory 
articulations) 


delayed articulatory and language development 
articulatory defects secondary to dental defects 
articulatory defects secondary to hearing loss 


dysphonia 


| A >% >% è% o 


nasal and facial grimacing associated with speaking 


Articulatory Defects Because of surgical advancements, 
articulatory defects are no longer inevitable in children 


with cleft lip and palate but still commonly occur. Rarely 
does cleft lip alone produce defects of articulation unless 
its repair produces an exceptionally short, tight upper lip 
that prevents the lower one from reaching it for the bila- 
bial sounds /p/, /b/, and /m/. Perceptually acceptable, 
although cosmetically distracting, are the child’s compen- 
satory attempts to approximate the upper incisors with 
the lower lip to make bilabial sounds, called labiodental- 
ization. 

The remaining articulatory defects in children with cleft 
lip and palate are from the following causes described under 
the next six subheads. 


Reduced Intraoral Pressure Even when articulatory place- 
ments are correct, reduced intraoral pressure secondary to 
nasal air escape will cause consonant distortion. The greater 
the nasal air escape, the less intraoral air available for pres- 
sure consonants. Somewhere along the intraoral pressure 
deficiency continuum, intelligibility begins to deteriorate, 
ending in unintelligibility. 

Minor velopharyngeal insufficiency producing intraoral 
pressure loss that results in mildly weak consonants pres- 
ents a different set of problems than do large openings with 
correspondingly poor consonant precision. The mild or bor- 
derline insufficiencies cause more decision-making prob- 
lems. Clinicians find themselves asking, “Is he hypernasal or 
isn’t he? Are the consonants weak or aren’t they? Yesterday 
she sounded hypernasal, today she doesn’t. She’s hypernasal 
only when she’s tired or talks fast. If she puts forth consider- 
able exhalatory pressure she is less hypernasal.” What these 
statements reveal is that the patient has mild and inconsis- 
tent velopharyngeal insufficiency. In some instances, the 
slightest drop in effort during exhalation or velopharyngeal 
closure elicits the audible effects of the borderline velopha- 
ryngeal status. In many instances, this type of problem is 
easily corrected with speech therapy. In other cases, the 
patient is working maximally to correct the valving problem 
and a loss in adenoid tissue and head growth may put her 
over the edge and require surgical management. Another 
possible contributing factor is muscle fatigue. Tachimura and 
Nohara (2004) have shown that the velopharyngeal port 
mechanism appears to fatigue faster in children with 
velopharyngeal incompetence. 

A secondary consequence of nasal emission is audible 
noises from air passing through the nares, distorting and 
masking consonants and distracting the listener. 


Compensatory Articulatory and Phonatory Substitutions 
Remarkably, children born with palatopharyngeal insuffi- 
ciencies seem intuitively to grasp early in life the pressure 
physiology of their own speech mechanisms. Without 
instruction, they discover they can produce nonstandard 
sounds with the larynx, pharynx, velum, and tongue that 
simulate the manner of production of standard consonants, 
which they cannot make. The correct place of production is, 
however, often sacrificed. Those who acquire velopharyngeal 
insufficiency later in life have much less tendency to develop 
compensatory errors. The drawings in Fig. 4.12 correspond 
with the following substitutions, derived from Trost’s (1981) 
radiographic studies. 





Glottal stop 
-implosive phase 


\ i x e f 
\ x RAA 
A+ N 
V LA 
a Glottal stop 
WA -explosive phase 
Ñ 4 
Glottal 





Posterior nasal 
fricative continuant 


Velar 
Cc fricative continuant 
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Pharyngeal stop 
-implosive phase 7 


is Pharyngeal stop 
O -explosive phase 


Pharyngeal 
B fricative continuant 


Mid-dorsum palatal 
stop-explosive phase 


Mid-dorsum palatal 
D stop-implosive phase 


Figure 4.12 (A-D) Compensatory, nonstandard articulatory substitutions in palatopharyngeal insufficiency. 
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1. The glottal stop An explosive-like grunt or glottal coup 
/?/, made by tightly adducting the vocal folds, building up 
infraglottal air pressure, and suddenly releasing it by 
abducting the vocal folds. Glottal stop substitutions are 
used in place of the standard stop consonants /p/, /b/, /t/, 
/d/, /k/, and /g/. The glottal stop is often produced simul- 
taneously with the correct articulatory placement for the 
sound (coarticulation). 


2. The glottal fricative A continuous friction noise made by 
approximating, but not fully adducting, the vocal folds. It 
is used as substitution for the standard fricative continu- 
ants /f/, /z/, /f/, and /3/. 


3. The pharyngeal stop A linguapharyngeal stop consonant 
substitution for /k/ and /g/. It can be simulated by first 
walking the tongue backward from the /k/ position down- 
ward against the posterior pharyngeal wall and then sud- 
denly releasing air that has been built up below the level 
of the tongue. 


4. The pharyngeal fricative A continuous friction noise substi- 
tuted in place of standard fricative-continuants, formed by 
retraction of the tongue against the back wall of the pharynx. 


5. Velar fricative A continuous friction noise substituted in 
place of standard fricative-continuants, formed by retrac- 
tion of the tongue against the posterior palate. 


6. The posterior nasal fricative A velopharyngeal fricative 
produced by airway constriction as the soft palate approx- 
imates the pharyngeal wall, but which does not close the 
port. The air is then nasally released. The movement is 


sometimes aided by posterior posturing of the tongue. The 
sound is heard as an audible friction noise associated with 
nasal air escape. Radiographically, this velar articulation is 
seen as a blurring of movement called velar flutter. The 
posterior nasal fricative is used as a substitution for /f/, /z/, 
/f/, and /3/. 


7. The mid-dorsum palatal stop A linguapalatal stop made 
by placing the tongue in the approximate position for the 
glide /j/. It is used as a replacement for /t/, d/, /k/, and /g/. 


The tendency for children with cleft palate to posteriorize 
lingual placements has been known for a long time. The 
tongue can exert an upward force on whatever soft palate 
tissue remains in an effort to seal off the port and, at the 
same time, impound air behind the tongue at this presum- 
ably more advantageous location. Compensatory coarticula- 
tions or atypical simultaneous articulatory maneuvers have 
also been observed; for example, a glottal stop substitution 
for /b/ while at the same time approximating and separating 
the lips even though the air is released at the glottal and not 
at the bilabial level (Trost, 1981). 

Bzoch (1979) made a useful comparison between the 
articulation errors of 120 children aged 3 to 6 without clefts 
who were considered “normal” in their articulatory develop- 
ment for their age range and 60 matched children with cleft 
palate. Table 4.3, which contains normative information on 
mastery of articulatory skills reported in now classic studies 
by Templin (1957) and Poole (1934), shows the differences 
between the normal and cleft palate children. Note the ram- 
pant glottal stop and laryngeal and pharyngeal fricative sub- 


Table 4.3 Comparison of Articulatory Errors Most Often Found in Preschool Children with or without Cleft Palate 


Speech Sound 


Element Tested Templin Norm Pool Norm 
lpl 3 3.5 
[b/ 4 3.5 
i 6 4.5 
ld] 4 4.5 
Ikl 4 4.5 
lgl 4 4.5 
Ifl 3 5.5 
MI 6 6.5 
lol 6 7.5 
[6] 7 6.5 
Isl 4.5 7.5 
Izl 7 7.5 
{JI 4.5 6.5 
[tl] 4.5 = 
[d3/ 7 - 
I| 6 6.5 
il 35 4.5 
ll 4 75 
Iw] 3 3.5 
Imi 3 3.5 
Inf 3 4.5 
hl 3 4.5 


Errors of Cleft Palate 
Subjects in Order of 
Frequency of Occurrence 
(N = 60, 3 to 6 Years) 


Errors of 
Control Subjects 
(N = 120, 3 to 6 Years) 


None ?/p, PF 

p/b m/b, 2/b 
dit 2/t, PF, kft 
t/d ?|d, PF, n/d 
t/k 2/k, PF, t/k 
k/g, d/g 2/g, d/g, PF 
p/f, b/f, s/f 2/f, PF, p/f 
b/v, flv b/v, ?/v, m/v 
t/0, 5/6, f/6 2/6, PF, t/@ 
dð d/ð,?/ð, PF 
6/s, t/s PF, 2/s, t/s 
s/z, 6/z, d/z PF, 2/z, s/z 
s/J, tJ] PF, 2// 

Jit, stl, t/t] PF, 2/t/, [ftf 
t]/d3, d/d3, 3/d3 PF, 2/d3, 3/d3 
wji w/l, 2/| 

?/j 2/j, wl, Ij 
wjr w/r, jr, PF 
jiw ?/w, Iw 
b/m PF, 2/m 
m/n 2/n, jin 

n/y n/y 


Source: From Bzoch, K.R. (1979). Communicative disorders related to cleft lip and palate. Boston: Little Brown, p. 166. 


PF, either pharyngeal or velar fricative substitution. 
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stitutions in the cleft palate population. Because of improved 
surgery, there are fewer children with clefts exhibiting the 
articulatory errors recorded over 50 years ago, but when the 
errors are present they are similar to those described. 


Dental Abnormalities Most people with cleft palate have a 
defect of the dental and maxillary-mandibular relationship; 
missing teeth; malposed teeth; malocclusions; narrow, 
highly arched palates; scarred palates. Articulatory distor- 
tions arise from aberrant friction noise, or absence of friction 
noise, on sound requiring that the air stream be deflected off 
of the anterior teeth. The anterior lingual consonants most 
affected are /t/, /d/, /s/, /z/, /f/, 131, /t/, and /d/. Interdentaliza- 
tion of these sound is also common when the maxilla is 
abnormally small compared with the mandible, inducing a 
forward tongue carriage. 


Conductive Hearing Loss Serous otitis media producing con- 
ductive hearing loss is practically universal in the cleft palate 
population because of defective aeration and equilibration 
with atmospheric pressure of the middle ear. The cause is 
failure of the eustachian tube to open during swallowing due 
to defective velar musculature. If hearing loss is severe, it may 
interfere with expressive language and articulatory develop- 
ment or, if mild, can produce phonemic distortions. 


Delayed Speech and Language Development When research 
findings on articulatory and language development in chil- 
dren with cleft palate are collated, they provide convincing 
evidence that, evidence that, as a group, these children are 
delayed compared to typically developing children. Differ- 
ences in tests used and research methodology, failures to 
recognize children with syndromes, and breadth of semantic 
assessments require that the following summary statements 
be interpreted cautiously, 


+ Articulatory development is slower (Bzoch, 1956; Couni- 
han, 1960). 


+ First words are later (Bzoch, 1956). 


Sentences are-shorter and less complex (Spriestrsbach, 
Darley, and Morris, 1958). 


# Language comprehension and usage is delayed (Philips 
and Harrison, 1969). 


Expressive language development is delayed (Smith and 
MeWilliams, 1968). 


+ Children with compensatory articulation disorders have 
abnormal language development that is greater in situa- 
tional context compared to semantic or discourse (Pam- 
plona, Ysunza, Gonzalez, Ramirez, Patino (2000). 


Studies by Bzoch, Morley, Fex, et al. (1973) pointed to an 
even more definitive picture of expressive language delay. 
In 23 of 25 infants, mean age 18 months, they found an 
almost consistent and significant delay in expressive lan- 
guage but, interestingly, not in receptive, despite histories 
of widespread conductive hearing loss. Swanson (1973) 
reported similar findings in 37 infants as did Morris and 
Ozanne (2003) in 10 or 20 children with clefts. Morris and 
Ozanne selected children who had been identified at 2 
years of age as either at age level or significantly delayed in 


expressive abilities and repeated the tests at 3 years. The 
group with delayed early expressive language abilities at 2 
years continued to have expressive language problems at 3 
years of age and had poorer speech development compared 
with the nondelayed group. They concluded that there is a 
cleft palate subgroup that exhibits delays either because of 
structural/anatomic deficits, cognitive/linguistic delay, or a 
language/phonological disorder. Snyder and Scherer (2004) 
suggest that assessment of early play gestures may assist 
clinicians in identifying children with clefts who are at risk 
for later language impairment so that early intervention 
can be initiated. 


Voice Disorders It has been established that both children 
and adults with cleft palate and associated hypernasality 
and nasal emission have disorders of phonation often associ- 
ated with hyperplasia and hyperemia of the vocal folds 
(Brooks and Shelton, 1963; McDonald and Baker, 1951). 

Breathy, husky voice quality commonly observed in chil- 
dren who have cleft palate has been attributed to an attempt 
to compensate for the loss of intraoral pressure (Bzoch, 1964) 
or to compensate for an undetected laryngeal web (Luerssen, 
Proggmayer,Ptock, 2004) by overdriving the vocal folds to 
overcome reduced intelligibility of speech. McWilliams et al. 
(1969, 1973) found that 27 of 32 children with cleft palate 
were chronically hoarse and later developed bilateral vocal 
fold nodules. Again, compensatory laryngeal valving, over- 
driving the vocal folds, was blamed for these vocal abuse dis- 
orders. Bronsted, Liisberg, Orsted, et al. (1984) showed that 
more than half of the patients with cleft palate and velopha- 
ryngeal insufficiency were dysphonic. The incidence of struc- 
tural changes of the vocal folds from hyperemia to vocal 
nodules is not difficult to understand in light of the fact that 
the use of glottal stops as compensatory plosives produces 
pressure, friction, and blunt trauma to the vocal folds and to 
the fact that those children with velocardiofacial syndrome, 
or 22q11 deletion often have laryngeal webs. 

Leder and Lerman (1985) hypothesized that adults, like 
children, with cleft palate and hypernasality also hyper- 
adduct their vocal folds to produce a constriction below the 
velopharyngeal portal in an attempt to decrease nasal air 
leak. Based on acoustic spectrographic analyses, the authors 
concluded that the vocal folds were inappropriately hyper- 
adducted to provide a constriction in place of the inade- 
quately functioning velopharyngeal portal in an effort to 
eliminate nasal air leak, a causal factor for increased vocal 
nodules in the cleft palate population. 


Congenitally Short Palate and Large Nasopharynx 


Velopharyngeal insufficiency can exist without overt clefts 
of the hard or soft palate. The suspected causes are congeni- 
tal short soft palate, large nasopharynx, or both, but in many 
cases neither can be anatomically established and the 
velopharyngeal insufficiency must be considered idiopathic. 
These conditions often remain undetected until adenoidec- 
tomy deprives the patient of tissue mass in the nasopharynx 
and veloadenoidal closure (Gibb, 1958; Green, 1957; Maryn, 
Van Lierde, De Bodt, and Van Cauwenberge, 2004; Neiman 
and Simpson, 1975; Ren, Isberg, and Henningsson, 1995; 
Finkelstein et al., 1996, 2002). A functionally short soft palate 
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Figure 4.13 Video fluoroscopic frames showing congenital velopha- 
ryngeal insufficiency in a 6-year-old child at rest in lateral (a) and base 
(b) views; during nasal speech in lateral (c) and base (d) views; and dur- 
ing non nasal speech in lateral (e) and base (f) views. (From Skolnick, 
M.L. [1970]. Cleft Palate | 7, 803-816.) 


will often look normal on routine oral examination and may 
elevate well, but it does not contact the posterior pharyngeal 
wall (Fig. 4.13). According to Ren, Isberg, and Henningsson 
(1995), enlarged tonsils may also cause hypernasality after 
adenoidectomy. The enlarged tonsils restrict palatal eleva- 
tion and in so doing result in abnormal coupling of the nasal 
cavity. 

Two anatomic defects sometimes associated with a func- 
tionally incompetent soft palate are submucous cleft and 
bifid uvula (Fig. 4.14). A submucous cleft can vary from a 
slight notch into the posterior border of the hard palate to a 
large, U-shaped absence of bone. Bifid uvula, partial or total, 
may be a clue to occult embryologic palatal defects (Fig. 
4.15). The presence of a submucous cleft should raise the 
question of whether or not a genetic examination is in order. 
If it co-occurs with middle ear and cardiac defects, it may be 
indicative of velocardiofacial syndrome (Luerssen, Prog- 
gmayer, and Ptok, 2004). 


Injuries 


Hypernasality and nasal emission can result from traumatic 
damage to the palatal or velopharyngeal anatomy: 


Figure 4.14 Congenital submucous cleft. (From Luchsinger, K., 
Arnold, G.E. [1965]. Voice speech language. Belmont, CA: Wadsworth 
Publishing.) 


surgical removal of portions of the hard and soft palate for 
benign and malignant tumors 


+ damage to the velopharyngeal musculature during tonsil- 
lectomy and adenoidectomy, producing scarring and stiff- 
ness of the muscles and restricting full sphincter motion 


sequelae to surgical removal of adenoids, unmasking sub- 
mucous occult velopharyngeal insufficiency 


traumatic accidents, such as falling on hard or sharp 
objects in the mouth and other lacerations or penetrating 
wounds of the palate 





Figure 4.15 Bifid uvula and submucous cleft. (From Massengill, R. 
[1972]. Hypernasality. Springfield, IL: Charles C. Thomas.) 
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Neurologic Disease 


Neurologic disease can cause hypernasality and nasal emis- 
sion by producing weakness (paresis or paralysis) or incoor- 
dination of the velopharyngeal muscles. A concept already 
stated about neurologic voice disorders is also true of nasal 
resonatory disorders: Hypernasality and nasal emission due 
to neurologic velopharyngeal insufficiency are components 
and signs of a more generalized dysarthria, and, although 
such resonatory disorders can occur in isolation, they are 
most often embedded in a complex of associated respiratory, 
phonatory, and articulatory dysarthric signs. Hypernasality 
and nasal air emission can be among the first indicators of 
neurologic disease. The following types of dysarthria com- 
monly manifest hypernasality and nasal emission from 
velopharyngeal weakness or incoordination: 


è Unilateral or bilateral lower motor neuron (flaccid) 
dysarthria 


+ Unilateral upper motor neuron (“flaccid”) dysarthria 
+ Bilateral upper motor neuron (spastic) dysarthria 


+ Mixed lower-upper motor neuron (flaccid-spastic) 
dysarthria 


+ Hyperkinetic (choreic-dystonic) dysarthria 


In Chapter 5, the velopharyngeal insufficiency component 
of the dysarthrias is discussed. Unlike phonatory and articu- 
latory dysarthric signs, which have individual characteristics 
depending upon site of lesion, thereby giving a clue to loca- 
tion of the lesion, there is nothing perceptually distinctive 
about the hypernasality and nasal emission of one dysarthria 
compared with another. An exception may relate to con- 
stancy or intermittency of hypernasality, depending upon 
whether the dysarthria is of the continuous or fluctuating 
type, a distinction that will be made shortly. It is, however, 
important for the diagnostician to know which dysarthric 
types produce hypernasality and which do not. For example, 
a patient suspected of Parkinson’s disease who has hyper- 
nasality and nasal emission in addition to a hypokinetic 
dysarthria should be suspected of harboring a lesion in 
another area of the nervous system in addition to the basal 
ganglia, because pure hypokinetic dysarthria does not 
include velopharyngeal insufficiency. 


Flaccid Hypernasality and Nasal Emission 


Tenth (Vagus) Nerve Lesions The muscles of velopharyn- 
geal closure for speech are innervated by cranial nerves V, IX, 
X, and XI. Specifically: 


1. The levator veli palatini by the pharyngeal branch of the 
tenth (vagus) nerve 


2. The tensor veli palatini, which contribution to closure is 
uncertain, by the motor division of the fifth (trigeminal) 
nerve 


3. The pharyngeal muscles by the ninth, tenth, and eleventh 
cranial nerves (pharyngeal plexus) 


In Chapter 5, Fig. 5.3 and Table 5.1 show that damage to 
the pharyngeal branch of cranial nerve X, or above that level, 
up to the nucleus ambiguus in the medulla, produces unilat- 
eral or bilateral flaccid paralysis of the soft palate. Figure 5.3 
shows how a right unilateral lesion of the vagal nucleus pro- 
duces weakness of the soft palate on the same side of the 
lesion. Key points concerning this locus of lesion are: 


+ The paralyzed right side rests lower in the mouth than the 
normal side. 


¢ On phonation or upon stimulation of the gag reflex, the 
soft palate pulls upward toward the normal side. 


The same figure shows the effects of a bilateral lesion of 
the vagal nucleus: 


+ Both sides of the soft palate rest low in the mouth. 


+ On phonation or stimulation of the gag reflex, both sides 
move sluggishly or not at all. 


Absence or reduction of a gag reflex, also called a hypoac- 
tive gag reflex, is exclusively a sign of flaccid lower motor 
neuron paralysis. However, it is, infrequently, also a psy- 
chogenic sign. 

Neurologic diseases that commonly damage the nucleus 
ambiguus or the tenth nerves as they exit the base of the skull 
are tumors, vascular diseases, cerebrovascular accident, 
infectious diseases, bony malformations of the base of the 
skull, and degenerative diseases. Myasthenia gravis, when of 
the bulbar type, can affect the pharyngeal branch of the vagus 
nerve at the myoneural junction (Chapter 5). Hypernasality 
and nasal emission are prominent signs of this disease. 


Unilateral Upper Motor Neuron Lesions A unilateral cor- 
tical or subcortical lesion of the pyramidal tract (upper 
motor neuron) often produces unilateral paralysis of the soft 
palate on the side opposite the lesion. The classic case is the 
patient who sustains a left frontal cerebral hemisphere 
lesion and becomes hemiparetic, dysarthric, and apraxic. 
These patients typically have a unilateral upper motor neu- 
ron dysarthria that consists of mild articulatory imprecision 
from unilateral facial and tongue weakness on the side oppo- 
site the lesion. The effects of the unilateral soft palate paresis 
may not be heard because of its mildness, but it can be seen 
on oral examination. Evidently the tongue, lower face, and 
soft palate are not as well endowed with ipsilateral corticob- 
ulbar fibers as the larynx, which has no such unilateral pare- 
sis from unilateral corticobulbar tract lesions. 

The asymmetric appearance of a unilateral upper motor 
neuron paresis of the soft palate at rest is indistinguishable 
from a unilateral lower motor neuron paresis; in both, 
phonation and stimulation of the gag reflex cause the soft 
palate to pull up toward the normal side. 


Spastic (Pseudobulbar) Hypernasality and Nasal Emission 


Bilateral Upper Motor Neuron Lesions Typically, hyper- 
nasality and nasal emission are components of spastic 
dysarthria. However, the muscular physiology is not one of 


62 Clinical Voice Disorders 





paralysis, as in flaccid dysarthria, but slowness of velopha- 
ryngeal movement due to disinhibition of antagonist mus- 
cles producing dyssynchronous movements and incomplete 
closure. Attesting to the intactness of its innervation is either 
a normal or hyperactive gag reflex; upon stroking the faucial 
arches or back wall of the pharynx with a tongue depressor, 
the gag is easily elicited; the soft palate thrusts upward bilat- 
erally, and the pharyngeal walls briskly move medially. 


Mixed Flaccid-Spastic Hypernasality and Nasal Emission 


Lower Plus Upper Motor Neuron Lesions The most 
severe cases of neurologic-based hypernasality and nasal 
emission are found in patients with mixed flaccid-spastic 
dysarthria owing to their cumulative effects. The classic 
example is amyotrophic lateral sclerosis (ALS). However, 
any lesion that can affect both upper and lower motor neu- 
rons simultaneously can produce this same form of 
velopharyngeal insufficiency. In ALS, the most common dis- 
ease seen in clinical practice producing this mixed 
dysarthria, hypernasality and nasal emission are mild in 
the early stages. As the dysarthria progresses, hypernasality 
and nasal emission worsen until there is virtually no move- 
ment of the velopharyngeal muscles. The soft palate hangs 
low in the mouth, immersed in ropey saliva, and remains 
immobile during phonation or stimulation of the gag reflex. 
The paralysis is not always symmetric; even though both 
sides of the soft palate can become paralyzed, one side may 
be more than the other. 


Hyperkinetic (ChoreicDystonic) Hypernasality and Nasal 
Emission 


Basal Ganglia Lesions Fluctuating during the course of 
contextual speech are the hypernasality and nasal emis- 
sion of the choreic and dystonic movement disorder 
dysarthrias. The dysarthrias of dystonia, choreoathetosis, 
and chorea all have periodic velopharyngeal insufficiency, 
whether the movement disorder is the congenital or early 
acquired “cerebral palsy” or of later onset. Common to all 
are spontaneous, uncontrolled velopharyngeal move- 
ments, more prominent in the dystonias than the choreas 
because of the slowness of the dystonias and longer dura- 
tion of velopharyngeal opening. 


Palatopharyngolaryngeal Myoclonus 


Discussion of the slow (1 to 4 Hz), rhythmic, myoclonic-like 
movements of the soft palate and lateral pharyngeal walls in 
palatopharyngolaryngeal myoclonus is somewhat paradoxi- 
cal here, because these movements rarely produce hyper- 
nasality or hyponasality even though each time the soft 
palate elevates myoclonically, it touches the posterior pha- 
ryngeal wall, and, when it pulls away, it opens the velopha- 
ryngeal port. The moment of closure is so brief that its 
perceptual effects escape the ear. For clinical purposes, then, 
palatal myoclonus is a visual not an auditory sign. But, its 
importance in neurologic diagnosis cannot be overstated. It 
is easily overlooked because of its subtle movements, yet its 
presence indicates a brain-stem lesion. 


Psychogenic Factors 


The incidence of nonorganic hypernasality and nasal emis- 
sion in clinical practice is low, but a group of patients com- 
prise what is sometimes called “functional” hypernasality. 
These are children and adults who sound hypernasal but 
who have normal palatal structure and function as demon- 
strated on the oral examination and videofluoroscopy. The 
etiologies of nonorganic “velopharyngeal insufficiencies” are: 


+ conversion reaction 


reduced effort to produce normally vigorous speech 
owing to poor self-image, sometimes combined with bor- 
derline organic velopharyngeal insufficiency 


reduced speaking effort as a sign of “path of least resist- 
ance” in physically ill, debilitated patients 


imitation of relatives or friends with whom the person 
strongly identifies 


¢ Organic Hyponasality 


Hyponasality, or denasality, is the opposite of hypernasality. It 
is the reduction or absence of nasal resonation on the normal 
nasal semivowels and during normal assimilation nasality. It 
is due to velopharyngeal overclosure or to obstruction. In 
common everyday terms, it is called adenoid speech, or speak- 
ing with a “stuffy nose.” Curiously, many professional people 
in medicine and allied health fields confuse hypernasality and 
hyponasality; when they hear hyponasality, sometimes they 
not only use the term hypernasality or nasal speech instead, 
but they also infer velopharyngeal insufficiency instead of 
nasal or nasopharyngeal obstruction. It is an easy mistake to 
make by the untrained listener, because both types are so 
closely associated with the nasal cavities and perceptual 
changes in resistance. Official terminology for disorders of 
nasal resonance from nasal cavity obstruction follows: 


+ Rhinolalia clausa (closed nasality) 


+ Rhinolalia clausa, posterior A perceptually distinctive 
type of hyponasality from obstruction of the posterior 
nasal cavities or nasopharynx. 


+ Rhinolalia clausa, anterior A perceptually distinctive 
type of hyponasality from obstruction of the anterior 
nasal cavities. 


The normal speaker can simulate both types. Posterior: 
Say a sentence having nasal semivowels with the velophar- 
ynx continuously closed. For example, “Many men marched 
and sang at night.” Note that the only phonemes adversely 
affected are the nasals /m/, /n/, and / y /, which shift toward 
/b/, /d/, and /g/. Patients who have hypertrophied adenoids, 
tumors, or polyps in the nasopharynx are, in effect, speaking 
as if in a state of continuous velopharyngeal closure. Ante- 
rior: Say the same sentence with normal velopharyngeal 
action but while pinching the nares closed, producing the 
effect of an anterior nasal cavity obstruction. Now a hollow, 
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muffled resonation on the nasal semivowels is heard for the 
reason that, although the nasal cavities resonate the sound, 
the far end is closed, a condition called cul-de-sac res- 
onation. 

A secondary effect of nasal obstruction is mouth breath- 
ing. If nasal airflow during inhalation is reduced, the only 
way of maintaining adequate oxygen intake is orally. The 
child or adult whose mouth is always open and who has 
hyponasal speech is likely to have nasal obstruction and 
ought to alert the clinician to the need for investigation of its 
cause. Nasal obstructions are most often due to inflamma- 
tions of the nasal turbinates from upper respiratory infec- 
tions or allergies, benign and malignant tumors, enlarged 
tonsils that can push the velum against the pharyngeal walls, 
deviated nasal septums, and surgical overclosure of the 
velopharyngeal port secondary to pharyngoplasty for 
velopharyngeal insufficiency. Because such operations, 
exemplified by the pharyngeal flap, cannot be “fine tuned,” 
often the flap is made too wide for the nasopharynx. The 
patient enters surgery hypernasal and emerges hyponasal. 
Often, this is just temporary from edema that disappears, 
leaving an adequate nasal airway, but sometimes it is per- 
sistent because the flap is too wide. If this obstruction, like 
any obstruction not modified, is complete or nearly so, the 
following complications may occur: 


habitual mouth breathing 
snoring 


+ 
+ 
reduced sense of smell with loss of appreciation for food 
¢ halitosis from susceptibility to microorganisms in the 
nasal cavities and drying of the oral mucosa 


+ drying of gingival tissues around the anterior teeth, 
which, according to some dentists, can lead to periodon- 
tal disease 


+ danger of choking on food 


+ elongation of the bony structures of the face resulting in 
adenoidal faces 


The only recourse is to modify the obstruction surgically. 

A pharyngoplasty procedure when a child or adult 
becomes hypernasal after tonsillectomy and adenoidectomy 
(T and A) can be premature. Many people with normal 
velopharyngeal function who become hypernasal, some- 
times for a few months, after T and A, if left alone, will spon- 
taneously adapt to loss of adenoid tissue and will recover 
normal velopharyngeal closure. Others respond quickly to 
behavioral modification techniques. Thinking that the 
hypernasality is a sign of permanent organic velopharyngeal 
insufficiency, surgery is sometimes performed, and then 
complete function of the musculature returns, with the 
effect of overclosure and consequent hyponasal speech and 
secondary obstructive effects. 


Patulous Eustachian Tube (PET) 


A most unusual and little-discussed cause of hyponasality is 
a syndrome called patulous eustachian tube (PET). It is wor- 


thy of discussion because it is easily missed and misdiag- 
nosed. Its symptoms are: 


e aroaring sound in the ears synchronous with respiration, 
known as autophony 


+ a feeling of fullness in the ears 


+ irritability, depression, anxiety, and preoccupation with 
the symptoms 


+ a change in speech, as if the person were talking with a 
cold in the nose (i.e., hyponasality) 


+ disappearance of symptoms when lowering the head 
between the knees or when lying down 


The reason for patency of the eustachian tube is the loss of 
tissue mass surrounding its orifice or from a change in 
velopharyngeal muscular tonicity. 

Although the etiology is unclear, loss of weight, dehydra- 
tion, pregnancy, radiation therapy to the nasopharynx, 
fatigue, excessive estrogen, otitis media, and fibrosis of the 
eustachian tube mucosa secondary to inflammation have all 
been suggested as causative factors (Tsuji, Yamaguchi, and 
Moriyama, 2003). 

The subjective sensation of voice amplification is due to 
the fact that the voice is resonated in the middle ear cavity, 
having traveled up the open eustachian tube, which is sup- 
posed to remain closed except upon swallowing. Autophony 
may have been first described by Johannes Miller, who wrote 
in his Handbook of Physiology that he could produce a snap- 
ping sound in his ears when he contracted his palatine mus- 
cles and elevated his soft palate. He said: “If immediately 
afterwards I emit a humming vocal tone while my mouth is 
either closed or slightly open, this tone has an extraordinary 
resonance” (Perlman, 1939). 

Hyponasality as a sign and symptom of this syndrome 
(Landes, 1967) has an interesting purpose, chiefly protective; 
the patient keeps the velopharyngeal port closed to prevent 
the voice from reaching the eustachian tube, which opening 
lies above the level of the velopharyngeal sphincter. The rea- 
son the symptoms disappear with the head lowered or when 
the patient is lying down is that venous engorgement of the 
area surrounding the opening closes the eustachian tube. 

Treatment consists of (1) patient education as to the causes 
of the disorder, (2) weight gain, or (3) injection of Teflon or 
autologous fat at the orifice of the tube to close it. Anxiety, 
depression, and the general impression of psychoneurosis 
improve noticeably in patients treated with augmentation 
procedures. Crary, Wexler, and Berliner (1979) found that 9 of 
10 patients so treated said that they felt much better, had 
improved concentration, had increased enjoyment of life, and 
on the Minnesota Multiphasic Personality Inventory showed 
a decrease in the neurotic triad of hypochondriasis, depres- 
sion, and hysteria. Doherty and Slattery (2003) successfully 
treated refractory PET in patients with autologous fat graft 
plugging of the eustachian tube at its nasopharyngeal orifice 
in combination with myringotomy and ventilation tube 
placement. Nevertheless, PET presents a challenging man- 
agement problem because many methods are ineffective, and 
successful treatment methods are often temporary. 


64 Clinical Voice Disorders 





+ Mixed Nasality 


Mixed nasality, also called rhinolalia mixta, may seem para- 
doxical: simultaneous velopharyngeal insufficiency and 
nasal obstruction without one cancelling out the other. A 
mixture of hyponasality of nasals and hypernasality of vow- 
els nevertheless does occur in patients who have coexisting 
obstruction in the anterior or posterior nasal cavities simul- 
taneous with organic or nonorganic velopharyngeal insuffi- 
ciencies. 


+ Clinical Examination 


The following primary questions need to be answered about 
the patient suspected of a nasal resonatory disorder: 
1. Does a nasal resonatory disorder, in fact, exist? If so: 

+ What type is it? (hypernasality, hyponasality?) 

+ How severe is it? 

@ What is its cause? 

How should it be treated? 


2. With what physical, psychologic, and educational defi- 
ciencies is the resonatory disorder associated and what 
are their interactions? Are these combinations of factors 
indicative of need for genetic assessment? 


3. How can they be reduced or eliminated? 

To answer these questions, the following data are neces- 
sary: 
1. History of the primary and associated disorders 


2. Perceptual examination as to presence, type, and extent 
of the nasal resonatory disorder 


3. Examination of the speech anatomy and physiology via 
direct observation and indirect biophysical measurement 


. Articulation analysis 

. Language evaluation 

. Psychometric and psychologic evaluation 
Social work evaluation 


. Hearing evaluation 
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. Allied health specialists such as surgical, dental, and 
genetic consultative evaluations and recommendations 


Perceptual Examination 


Judgment as to the presence and severity of a nasal res- 
onatory disorder lies within the ear and mind of the speech 
pathologist and not a mechanical or electronic instrument, 
although such devices can support the clinical impression. 


Contextual Speech Impression 


Have the patient simulate everyday speech as closely as pos- 
sible. Depending upon age and cooperativeness, two kinds of 
connected speech samples can be obtained: conversational 
speech and oral reading. Simultaneous video and audio 
recordings may be made for confirmational analysis later. As 
the patient speaks, the clinician should try to answer the fol- 
lowing questions: 


1. Do I hear normal, excess, or insufficient nasal res- 
onation? 


2. How distracting is the resonatory disorder as I try to 
grasp the content of the patient’s speech? 


3. Are consonant pressures low so that the sounds are diffi- 
cult or impossible to identify? 


. Are vowels hypernasal? 

. Are nasal semivowels hyponasal? 

. Do I hear nonstandard sound substitutions? 
. Do I hear standard sound substitutions? 


. What is the level of overall speech intelligibility? 
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. Does language development sound delayed? 


10. Is a voice (laryngeal) disorder also present, and if so, 
what are its perceptual features? 


11. Does the patient’s behavior suggest a hearing loss? 


12. Does general behavior suggest intellectual, personality, 
or emotional disturbances? 


13. What is the impression given by the patient’s physical 
appearance? 


14. Do facial grimaces or other habit patterns accompany 
contextual speech? 


15. Does the speech pattern change with different rates of 
production, orality, or differences in contextual material? 


Semiobjective Tests for Hypernasality and Hyponasality 
Suppose, after listening to contextual speech, the clinician 
suspects hypernasality, nasal emission, or hyponasality. In 
the absence of aerodynamic or acoustic instrumentation, 
additional arguments for or against their presence can be 
obtained by means of the following informal observations or 
practical tests: 


1. Mirror-clouding test 
a. Hypernasality and nasal emission 


(1) Place a hand mirror or cold shiny metal object such 
as a Serving spoon under the patient’s nares. 


(2) Ask the patient to repeat a sentence that is free of 
all nasal semivowels (e.g., “We see three geese” or 
“The big black dog caught the stick”). People who 
have normal palatal and velopharyngeal closure 
should not leak air out the nose during such 
phrases. Fogging of the metal under one or both 
nares indicates velopharyngeal insufficiency. 
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b. Hyponasality 


(1) Ask the patient to repeat a sentence containing 
ample nasal semivowels (e.g., “Many men sang at 
noon”). 


(2) People with normal velopharyngeal opening 
should leak air out the nares during the nasal 
sounds, fogging the mirror. Little or no fogging 
indicates nasal obstruction. 


2. Nasal vibration test 
a. Hypernasality and nasal emission 


(1) While the patient repeats a sentence free of nasal 
sounds, such as “We see three geese,” with thumb 
and index finger lightly straddle the cartilaginous 
portion of the nose: if vibration can be felt, nasal 
resonation is likely. 


(2) Have the patient repeat the sentence, but this time 
pinch the nares shut: if cul-de-sac resonation can 
be heard, excess nasal resonation is likely. 


b. Hyponasality 


(1) While the patient repeats a sentence rich in nasal 
sounds, such as “Many men sang at noon,” straddle 
the nose with thumb and index finger: if vibration 
cannot be felt, suspect nasal obstruction. If pinch- 
ing the nares during the production of nasal 
sounds does not produce cul-de-sac resonation, 
also suspect nasal obstruction. 


3. Consonant differential pressure test for hypernasality and 
nasal emission 


a. Have the patient repeat a sentence free of nasal sounds 
and rich in pressure consonants, such as “The paperboy 
saw a baseball.” The first trial should be produced 
while the examiner listens carefully to consonant 
sharpness and sentence intelligibility. 


b. The second trial should be produced while the exam- 
iner pinches the nares shut and listens for any change 
in the consonant sharpness and intelligibility. A notice- 
able rise in audible consonant pressure or sharpness 
and intelligibility with the nares pinched indicates 
grossly the extent of the insufficiency. It also provides 
an impression of what the patient’s speech could be 
like with successful surgical or prosthetic treatment of 
the insufficiency. 


4. Nares occlusion test of exhalatory efficiency 


a. With nares open, ask the patient to inhale as deeply as 
possible and then on exhalation count to as high a 
number as possible until all breath is exhausted. (First 
demonstrate a rate of about two numbers per second.) 
Repeat this 3 times and take the average maximum 
number attained on the three trials. 


b. While the examiner pinches the nares shut, the patient 
repeats the same three trials of counting, again taking 
the average of the three trials. Patients who have nor- 
mal velopharyngeal closure produce little or no differ- 
ences between the maximum number attained on 


counting with nares open versus nares occluded. 
Patients who have hypernasality and nasal emission 
will produce a noticeably lower maximum number 
with the nares open versus closed, giving evidence of 
the extent of nasal air wastage. 


These practical tests can be done in the office without spe- 
cial equipment. Quantification of nasal airflow and intraoral 
pressures by electronic instruments can yield similar results. 
Whereas they may not be essential for experienced clini- 
cians, neophyte clinicians who are unsure of their ability to 
detect differences in nasal resonance often benefit from 
instrumental analysis. 


Examination of the Speech Anatomy 


Once the clinician has identified a nasal resonatory disorder 
by the methods just discussed, the next step is to find a phys- 
ical explanation for the hypernasality and nasal emission or 
hyponasality. For this portion of the examination, the follow- 
ing equipment is required: 

1. A quiet room 


2. A chair for the patient that will enable the examiner to 
look into the patient’s mouth at different angles; a dental 
chair is ideal. 


3. An excellent source of light. Although a good flashlight 
will suffice, natural lighting, dental or surgical lighting, or 
light reflected from a head mirror are even better. 


4. Tongue depressors 
5. Dental mirror 
6. Endoscopic equipment 


The physical examination begins by asking the patient to 
sit upright, relaxed and comfortable. 


1. While the patient is breathing quietly, the clinician should 
be thinking of the following questions: 


a. Are there friction noises that suggest nasal or oral 
obstruction? 


b. Is the patient a mouth breather? 


c. Are there unilateral, bilateral, or midline clefts of the 
upper lip? 


d. Is there evidence of surgical repair of clefts of the 
lip? 


e. Does the upper lip appear thin or contracted? 


f. Does one or both sides of the nose appear flattened or 
distorted? 


g. Does the maxilla appear abnormally small? 


h. Does the mandible appear incongruously large, pro- 
truding beyond the maxilla? 


i. Does the mandible appear incongruously small in 
comparison with the maxilla? 
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2. Voluntary lip movements 


a. Does the patient have difficulty approximating the 
upper and lower lips? 


b. Does the patient have difficulty rounding, retracting, 
and protruding the lips? 


3. Voluntary mandibular movements 


a. Does the patient have difficulty depressing, lateraliz- 
ing, protruding, or elevating the mandible? 


b. On occluding the teeth, does the mandible protrude 
beyond the maxilla? 


4. Dental configuration 
a. Are teeth missing in the upper or lower dental arch? 


b. Are teeth improperly formed or positioned within the 
dental arch? 


c. Is there malocclusion between the upper and lower 
teeth? 


d. Do extra teeth (supernumerary) crowd the dental arch? 


5. Hard palate configuration 
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. Is there a unilateral or bilateral cleft of the hard palate? 
b. Is there an alveolar cleft? 


c. Is there a notch in the posterior hard palate on palpa- 
tion? 


d. Is there a palatal fistula present? 
e. Is the hard palate abnormally high and narrow? 
f. Is the hard palate scarred from previous surgeries? 


g. Have teeth erupted from the hard palate? 


6. Soft palate configuration 


. Does the soft palate appear abnormally short at rest? 
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. Is the soft palate asymmetric at rest? 


. Is the soft palate scarred? 


a an 


. Are the palatine tonsils enlarged? 
. Is the uvula absent? 
Is the uvula bifid or split? 


. Is the soft palate thin and translucent? 


a= am Oo 


. Does the soft palate elevate asymmetrically on phona- 
tion of /a/? 


. Does the soft palate fail to elevate bilaterally on phona- 
tion? 


j. Does the soft palate fail to elevate bilaterally on stimu- 
lation of the gag reflex? 


k. On cineradiographic, videofluoroscopic, endoscopic, or 
nasal endoscopic examination: 


(1) Does the soft palate fail to contact the posterior 
pharyngeal wall on non-nasal sounds? 


(2) Does the soft palate appear short? 


3) Does the soft palate fail to show a knee-angle dur- 
ing phonation? 


4) Does the soft palate contact a Passavant’s ridge on 
phonation? 


5) Does the presence of adenoids provide a point of 
contact by the soft palate on non-nasal sounds? 


6) Does the velopharyngeal sphincter fail to close? 


7) Is there a reduction or absence of medial pharyn- 
geal wall movement? 





8) What is the pattern of closure? 


Articulation Analysis 


Articulatory analysis is done for three reasons in patients 
with palatopharyngeal insufficiency: to determine the pres- 
ence or extent of deviant articulatory development; to iden- 
tify distortions secondary to orofacial and dental anomalies; 
and to identify compensatory, nonstandard articulatory 
substitutions and distortions. These objectives are separate 
and distinct. It is important to objectify the presence of 
impaired articulatory development, which may or may not 
be caused by the palatopharyngeal insufficiency. It may be 
due to mental retardation, hearing loss, environmental dep- 
rivation, or any other factor that affects speech production. 
The second and third reasons for articulation testing are 
that cleft palate and other palatopharyngeal insufficiencies 
produce their own special articulatory defects. For example, 
dental, maxillary, and mandibular anomalies markedly 
interfere with normal articulation, and, because no two 
individuals have the same anatomic abnormalities, their 
articulatory acoustics are never identical. Hence, no stan- 
dard nomenclature or precise phonetic symbols exist to 
indicate the peculiarity of a given individual’s distortions 
caused by anomalous oral architecture. The same can be 
said for the compensatory glottal, pharyngeal, and lingual 
stops and fricatives so common in children and adults with 
cleft palate. In short, the ordinary developmental or 
descriptive articulation test for assessing general articula- 
tory development needs to be supplemented to document 
the articulatory defects in this population that extend 
beyond delayed development. 

Addressing the special needs of the cleft palate popula- 
tion, Bzoch (1979) designed an Error Pattern Diagnostic 
Articulation Test (Fig. 4.16). In addition to simple substitu- 
tions, omissions, and distortions, this test adds recording 
space for gross substitutions, such as glottal stops, and pha- 
ryngeal fricatives, and for nasal emissions of sounds. The cli- 
nician should also describe in detail the location of the error, 
particularly the nonstandard distortions and substitutions 
produced by atypical labial, dental, lingual, pharyngeal, or 
laryngeal movements. 


Language Evaluation 


Children with cleft lip and palate are known for their delayed 
expressive language in addition to delayed articulatory devel- 
opment. Confronting the clinician is the need to establish 
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Figure 4.16 Error pattern diagnostic 
articulation test. (From Bzoch, K.R. 
[1979]. Communicative disorders re- 
lated to cleft lip and palate. Boston: 
Little, Brown, p. 169.) 


whether language is delayed and why, a more than usually 2. Dental anomalies. Missing and malpositioned teeth and 


complicated question within the context of cleft palate, for 
such delay often has multiple causes. Consequently, evalua- 
tion must consider not only documentation of language 
developmental milestones but also audiologic, familial, surgi- 


malocclusions have the same effect of depriving the child 
of normal anatomy for speech development. 


3. Trauma. Oropharyngeal pain from surgical procedure 


cal, medical, social, and psychologic history as well. The clini- 4. Emotional deprivation. Rejection by parents and sibling 


cian who works with children with craniofacial anomalies 
needs to be aware of the physical, environmental, and psycho- 


logic disturbances that can contribute to delayed language 
development. 6. Hearing loss. From serous otitis media and middle ear 


For example: infection 


1. The cleft itself. Inability to generate intraoral pressures 
within the first 2 or even 3 years of life, a period having 


5. Language deprivation. 
because of hospitalizations and family disruption 


Reduced language stimulation 


7. Developmental disabilities. As an associated disorder not 
stemming from environmental factors per se 


implications for pressure-consonant development, in itself 8. Self-consciousness. Reduced willingness to speak to min- 
probably delays not only articulation but also intelligibility. imize conspicuousness 
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Therapy for nasal resonatory disorders and cleft palate 
rehabilitation have not been included in this chapter owing 
to their extensiveness and complexity. This subject is cov- 
ered in the references listed in the suggestions for Additional 
Reading section at the end of this chapter. 


¢ Summary 


+ Hypernasality and nasal emission interfere with the aes- 
thetics of speech and can seriously compromise intelligi- 
bility, producing serious psychologic, educational, and 
occupational consequences. 


+ Hypernasality and nasal emission are caused by cleft 
palate, congenitally short palate or large nasopharynx, 
nasopharyngeal trauma, and by neurologic diseases that 
cause weakness and dyssynchronous movements of the 
velopharyngeal sphincter. 


+ Hyponasality, or denasality, is caused by obstruction of 
the nasopharynx or nasal passages. 
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Chapter 5 


Neurologic Voice Disorders 


Highly integrated neurophysiologic control is necessary for 
normal phonation. The vocal folds must be adducted to the 
midline and kept there with balanced and bilaterally symmet- 
ric adductor-abductor muscle tonus. Thyroarytenoid and 
cricothyroid muscle tension must be optimum. Glottal open- 
ing and closing has to be precisely timed, the folds adducted at 
the exact moment for onset of voiced consonants and vowels 
and abducted for voiceless. Should they adduct too soon, 
voiceless sounds would be voiced; too late, voiced sounds 
would be voiceless. Yet they must not overadduct, for then 
they would obstruct the exhaled air stream and produce 
strained voice or voice arrest. Should they fail to adduct or 
suddenly abduct, voicing would become breathy or whis- 
pered. The extrinsic muscles of the larynx must be able to 
elevate and depress the larynx in the neck, otherwise pitch 
variation will be restricted. On the other hand, if fluctuations 
are excessive, pitch will change unexpectedly and inappropri- 
ately. Exhalatory airflow must be forced from the lungs under 
constant pressure from moment to moment, for a sudden 
increase or decrease in such pressure may adversely produce 
irregular pitch and loudness changes. Air must be forced from 
the lungs with adequate pressure, otherwise the voice will be 
insufficiently loud, and at other times air must escape under 
low pressure for low loudness levels. 

When vascular, infectious, traumatic, neoplastic, and 
degenerative diseases damage regions of the nervous system 
that contain nerve cells responsible for phonation, abnormal 
voice is inevitable. Previous research has identified acousti- 
cally different dysarthrias and their associated dysphonias 
caused by damage to specific regions of the nervous system 
(Darley, Aronson, and Brown, 1969a,b). Flaccid, spastic, 
ataxic, hypokinetic, and hyperkinetic dysarthrias sound dif- 
ferent because physiologically and anatomically dissimilar 
regions of the nervous system are damaged in each type, 
producing different muscular pathophysiology. Neurologic 
voice disorders exhibit the same relationship between 
anatomic location of lesion and acoustic effect as the 
dysarthrias. In fact, neurologic voice disorders, technically, 


are dysarthrias; although they can occur in isolation, most 
often they are embedded in a more widespread complex of 
respiratory, resonatory, and articulatory dysarthric signs. 
Dysphonia can be the first sign of neurologic disease, the 
remainder of the dysarthria following as the disease pro- 
gresses. Such capricious and discrepant patterns of neuro- 
logic disease onset are common. 

There are, then, as many types of neurologic voice disor- 
ders as there are dysarthrias: 


1. Flaccid (lower motor neuron) dysphonia 


2. Spastic (upper motor neuron) (pseudobulbar) (supranu- 
clear, corticobulbar tract) dysphonia 

3. Mixed flaccid-spastic pseudobulbar dysphonia 

4. Hypokinetic (basal ganglia) (parkinsonian) dysphonia 

5. Ataxic (cerebellar) dysphonia 

6. Choreic (basal ganglia) dysphonia 

7. Dystonic (basal ganglia) dysphonia 

8. Dysphonia of palatopharyngolaryngeal myoclonus 


(brain stem) 
9. Dysphonia of organic (essential) tremor (brain stem) 
10. Dysphonia of Gilles de la Tourette’s syndrome 
11. Apraxia of phonation 
12. Akinetic mutism 


13. Foreign accent syndrome 


The voices of these neurologic voice disorders tend 
toward either constancy or variability of phonation 
depending upon whether or not the pathophysiology of 
the disease produces relatively steady or relatively fluctu- 
ating abnormal laryngeal or respiratory muscle move- 
ments. Neurologic voice disorders are subdivided into the 
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N 
3 Table 5.1 


Level of 
Lesion 


|. (Fig. 5.3A) 
Above origin 
of pharyngeal, 
superior 
laryngeal, 
and recurrent 
laryngeal nerves 


. (Fig. 5.3B) 
Above origin 

of superior 
aryngeal and 
recurrent 
aryngeal nerves 
but below origin of 
pharyngeal nerve 
. (Fig. 5.3C) 
Superior 
aryngeal 
nerve 





IV. (Fig. 5.3D) 
Recurrent 
laryngeal 
nerve 


V. (Fig. 5.3E) 
Myoneural 
junction 
(myasthenia 
gravis) 


Effect on Vocal Folds 


Unilateral 
Lesion 


One vocal 
fold fixed 
in abducted 
position 


Same as 
above 


Both vocal folds 
able to adduct, 
affected vocal 
fold shorter, 
asymmetric 
shift of epiglottis 
and anterior 
larynx toward 
intact side on 
phonation 


One vocal 
fold fixed in 
paramedian 
position 


Not applicable 


Bilateral 
Lesion 


Both vocal 
folds fixed 
in abducted 
position 


Same as 
above 


Absence of 

tilt of thyroid on 
cricoid cartilage, 
inability to view 
full length of 
vocal folds 
because of 
epiglottic over- 
hang, vocal 
folds bowed 


Both vocal 
folds fixed in 
paramedian 
position 


Restriction of 
adductor- 
abductor 
movements 


Effect of Lesions of the Vagus Nerve on Phonation and Resonation 


Effect on Phonation 


Unilateral 
Lesion 


Breathy, 
moderate, 
reduced 
loudness 
and pitch 


Same as 
above 


Breathy, 
hoarse 


Breathy, 
hoarse, 
reduced 
loudness, 
diplophonia 
(not in all 
cases) 


Not applicable 


Bilateral 
Lesion 


Extremely 
breathy to 
whispered 
(aphonia) 


Same as 
above 


Breathy, hoarse, 
reduced loud- 
ness, restricted 
pitch range 


Breathy, 
hoarse, 
reduced 
loudness 


Breathy, hoarse, 
reduced loud- 
ness; symptoms 
worsen with 
sustained 
speaking 


Effect on Soft Palate 


Unilateral Bilateral 

Lesion Lesion 

One side Both sides 
low, low, 
immobile immobile 

None None 

None None 

None None 

Not Both sides 
applicable low, 

immobile 


Source: Abstracted from Rontal, M., Tontal E. (1977). Lesions of the vagus nerve: Diagnosis, treatment and rehabilitation. Laryngoscope 87, 72-86; and from Ward, P.H 
nerve paralysis: An often overlooked entity. Trans Am Acad Ophthalmol Otolaryngol 84, 78-79. 


Effect on Nasal 
Resonation 


Hypernasality, 
nasal 
emission 


None 


None 


None 


Hypernasality, 
nasal emission; 
symptoms 
worsen 
with sustained 
speaking 


Associated 
Signs 


Glottal 


Sa 


coup and cough 
absent, weak,or 
mushy; difficulty 

in swallowing; 

nasal regurgitation 
of food; aspiration 
of secretions; 
pharyngeal paralysis 
me as above, except 
no pharyngeal 
paralysis or 
difficulty in 
swallowing 





None 


Unilateral: Marginal 


Bil 


Di 


airway, weak cough. 
ateral: Severe 
difficulty on inhalling 
for life purposes, 
inhalatory stridor, 
tracheostomy often 
necessary. 


fficulty in swallo- 
wing, nasal 
regurgitation of 
food, inhalatory 
stridor, articulation 
defects 


., Berci, G., Calcaterra, J.C. (1977). Superior laryngeal 
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following general categories, under which are subsumed 
specific neurologic dysphonias and aphonias (Table 5.1): 


1. Relatively constant neurologic voice disorders Abnormal 
voice quality, loudness, or pitch deviations are relatively 
constant or consistent during contextual speech and 
vowel prolongation. Fluctuations may be major or minor. 
This category includes flaccid, spastic (pseudobulbar), 
mixed flaccid-spastic, and hypokinetic (parkinsonian) 
dysphonias. 


2. Arrhythmically fluctuating, neurologic voice disorders 
Abnormal muscular physiology produces unpredictable 
and irregular quality, loudness, and pitch fluctuations 
from moment to moment during contextual speech, 
which are especially prominent during vowel prolonga- 
tion. Included are ataxic, choreic, and dystonic dysphonias. 


3. Rhythmically fluctuating, neurologic voice disorders 
Abnormal voice fluctuates relatively rhythmically from 
moment to moment and is particularly noticeable during 
vowel prolongation. In this group are included the dys- 
phonias of palatopharyngolaryngeal myoclonus and 
organic (essential) (heredofamilial) voice tremor. 


4. Paroxysmal neurologic voice disorders Relatively infre- 
quent, sudden bursts of aberrant voice are exemplified in 
Gilles de la Tourette’s syndrome. 


5. Neurologic voice disorders associated with loss of voli- 
tional phonation The following neurologic voice disor- 
ders are not classifiable as dysarthrias but fall under the 
umbrella of (a) apraxia of speech, including apraxia of 
phonation (no loss of muscle strength or coordination but 
loss of volitional control over laryngeal and respiratory 
movements); (b) akinetic mutism; (c) foreign accent syn- 
drome; and (d) frontal lobe syndrome. 


¢ Neurologic Voice Disorders in Infants 


Infant neurologic voice disorders cannot be classified 
according to the above schema because of limited knowl- 
edge of the relationship between abnormal voice and the 
neuroanatomic location of lesions in this age group. In 
general, however, newborn infants with central nervous 
system (CNS) lesions of unspecified location cry in an 
excessively high-pitched, shrill, weak, and unsustained 
voice. They require more intense and frequent pain stimuli 
to induce crying, and their cries are of abnormally short 
duration. For example, Karelitz and Fisichelli (1962) found 
that only 54% of neurologically impaired infants met the 1- 
minute cry duration criterion, in comparison with 92% of 
normal infants. Such infants produced statistically signifi- 
cantly fewer total sounds. Some were hypernasal (Fisichelli, 
1966). Cyanotic infants cried with delayed latencies (Kare- 
litz and Fisichelli, 1962), had fundamental frequencies 
twice as high as normal infants, and cried longer than nor- 
mal infants (Lind, Wasz-H6éeckert, Vuorenkoski, et al., 
1965). The infants of drug-addicted mothers cried at abnor- 


mally high fundamental frequencies (Blinick, Tavolga, and 
Antopol, 1971). 

Infants born with unilateral or bilateral vocal fold paraly- 
ses due to cranial nerve X (vagus) lesions also cry abnor- 
mally. Such injury is caused by twisting the neck during 
breech presentations, infections, vascular diseases, and cra- 
nial malformations. The left recurrent laryngeal branch of 
the vagus nerve is more frequently damaged than the right 
because of its longer course in the chest and, therefore, 
greater exposure to disease or damage from an enlarged 
heart, tumors, mediastinal cysts, and tracheobronchial tree 
malformations. The dysphonia may be solitary or one of sev- 
eral cranial nerve signs. The cries of infants with unilateral 
adductor vocal fold paralysis are either weak or absent. In 
bilateral abductor paralysis, the voice and cry may be para- 
doxically normal because of adequate approximation of the 
vocal folds, but such infants produce inhalatory stridor dur- 
ing crying because their vocal folds are too weak to abduct 
during inhalation. 


+ Neurologic Voice Disorders in Children 
and Adults 


Relatively Constant Neurologic Voice Disorders 
Flaccid Dysphonia: Tenth Cranial Nerve (Vagus) Lesions 


Cell bodies of the vagus nerves, which supply the larynx and 
pharynx, are found deep within the reticular formation of 
the medulla in the nucleus ambiguous. Experimental evi- 
dence in monkeys indicates that the superior third of the 
nucleus ambiguus is responsible for the pharynx and esoph- 
agus; caudal to it is a center for cricothyroid muscle func- 
tion; and caudal to that is a center for abductor laryngeal 
muscle function. The most caudal region is responsible for 
adductor muscle function. A cluster of cells ~2 cm long con- 
tains the cell bodies of cranial nerves IX, X, and XI. The vagus 
nerve emerges from the lateral surface of the medulla 
between the inferior cerebellar peduncle and the inferior 
olive. A series of discrete roots converge into a single trunk 
intracranially. Each vagus nerve trunk passes through paired 
openings, the jugular foramina, at the base of the skull, along 
with cranial nerves IX and XI (Fig. 5.1). Near its exit from the 
skull, each vagus nerve divides into three branches: 


1. Pharyngeal nerve The pharyngeal branch becomes 
identifiable as it emerges from the upper part of the 
inferior (nodos) ganglion of the vagus nerve and 
descends between the external and internal carotid 
arteries to the left of the middle pharyngeal constrictor 
muscles. At that point, it again divides into numerous 
filaments that join with branches from the sympathetic 
trunk and the glossopharyngeal and external laryngeal 
nerves to form the pharyngeal plexus. From there, nerve 
fibers are distributed to the pharynx and to all the mus- 
cles of the soft palate except the tensor veli palatini, 
which is supplied by the motor division of cranial nerve 
V (trigeminal). 
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Figure 5.1 Pathway of the vagus nerve from brain stem to larynx. 
R-I-L-M, remaining intrinsic laryngeal muscles. 


2. Superior laryngeal nerve This branch of the vagus nerve 
emerges from the inferior ganglion and descends along- 
side the pharynx, first posterior to the internal carotid 
artery and then medial to it. About 2 cm below the infe- 
rior ganglion, it divides into two additional branches: the 
internal and external laryngeal nerves. 


a. Internal laryngeal nerve This branch of the superior 
laryngeal nerve descends to the level of the thyrohyoid 
membrane and, piercing and entering it, further 
divides into two additional branches. Both contain 
afferent or sensory fibers from mucous membrane that 
lines the larynx above the level of the vocal folds. They 
also contain fibers from muscle spindles and other 
stretch receptors in the larynx. The upper branch of the 
internal laryngeal nerve supplies the mucous mem- 
brane of the epiglottis, the vallecula, and the vestibule 
of the larynx. The lower branch supplies the mucous 
membrane of the aryepiglottic folds and the dorsum of 
the arytenoid cartilages. 


b. External laryngeal nerve This efferent or motor 
branch of the superior laryngeal nerve descends poste- 
rior to the sternothyroid muscle and innervates the 
cricothyroid muscle. It also branches to the inferior 
pharyngeal constrictor. 


. Recurrent laryngeal nerve The term recurrent refers to 
the anatomic course of this nerve, which first descends 
to the lower neck and into the chest and then courses 
superiorly again. This nerve pair, unlike the rest that 
supply the larynx, is not bilaterally symmetric (Fig. 5.2). 
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Figure5.2 Asymmetric pathways of the left and right recurrent laryn- 
geal nerves. 


The right recurrent laryngeal nerve arises from the vagal 
trunk, it ascends alongside the trachea behind the com- 
mon carotid artery and, higher up, in or near the groove 
between the trachea and esophagus, entering the larynx 
behind the articulation between the inferior horn of the 
thyroid and the cricoid cartilage. In 70 to 80% of cases, 
the nerve divides into two or more branches before 
entering the larynx. The left recurrent laryngeal nerve 
splits off from the trunk of the vagus to the left of the 
arch of the aorta, winding under it from front to back. 
Ascending lateral to the trachea, it then pierces and 
enters the larynx, as on the right side. Both right and left 
recurrent laryngeal nerves supply all intrinsic muscles of 
the larynx except the cricothyroid. They also supply sen- 
sory filaments to the mucous membrane lining the lar- 
ynx below the level of the vocal folds, and they carry 
afferent fibers from stretch receptors to the intrinsic 
laryngeal muscles. 


The extrinsic laryngeal musculature is also implicated in 
lower motor neuron disease, although to a much lesser 
extent than the intrinsic muscles. Of the suprahyoid mus- 
cles, the anterior belly of the digastric is innervated by the 
mylohyoid branch of the inferior alveolar nerve; the poste- 
rior belly by cranial nerve VII; the stylohyoid by cranial 
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nerve VII; the mylohyoid by the mylohyoid branch of the 
inferior alveolar nerve; and the geniohyoid by the first cer- 
vical spinal nerve (C1) via the hypoglossal nerve. The 
infrahyoid muscles—the sternohyoid, sternothyroid, and 
omohyoid—are all innervated by the ansa cervicalis, and the 
thyrohyoid is innervated by fibers from C1 via the hypoglos- 
sal nerve. 

The autonomic nerve supply to the larynx originates in 
the dorsal motor nuclei of the vagus as parasympathetic 
fibers that synapse in the inferior ganglion and accompany 
the somatic motor nerves to the larynx. Sympathetic nerve 
fibers enter the motor nerves through connections with 
the superior cervical ganglion. Autonomic nerve fibers have 
chiefly secretory and vasomotor functions and maintain 
muscle tonus. 
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Effects of Tenth Cranial Nerve Lesions on Vocal Fold 
Movement and Phonation Lesions of the tenth cranial 
nerve at any point along its pathway from the nucleus 
ambiguus in the brain stem to the musculature cause flaccid 
paresis (weakness) or paralysis (immobility) of laryngeal 
muscles as well as dysphonia or aphonia. The extent of 
weakness, the position of vocal fold fixation, the unilaterality 
or bilaterality of the weakness, and the degree of voice 
impairment depend upon the location of the lesion along the 
pathway of the nerve and whether one or both nerves of the 
pair have been damaged. Refer to Fig. 5.3 and Table 5.1 for 
the following discussion. 


1. Intramedullary and extramedullary lesions affecting all 
branches of the vagus nerve to the larynx (Fig. 5.3A). 
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Figure 5.3 (A) Effects of unilateral and bilateral vagus nerve lesions at different locations on vocal fold and soft palate functions. Stippled line indi- 
cates lesions. (See Table 5.1 for corresponding information.) Top: Unilateral lesion of vagus nerve (nucleus), above origin of pharyngeal, superior laryn- 
geal, and recurrent laryngeal nerves. The right vocal fold is fixed in an abducted position, whereas the left adducts to the midline on phonation. The 
soft palate is paralyzed on the right, is resting, and pulls to the left on phonation. Bottom: Bilateral lesion. Both vocal folds are fixed in an abducted 
position on phonation. The soft palate is bilaterally paralyzed, is resting low, and does not move on phonation. R-I-L-M, remaining intrinsic laryngeal 
musculature. (B) Top: Unilateral lesion of vagus nerve above origin of superior laryngeal and recurrent laryngeal nerves but below bifurcation of pha- 
ryngeal nerve. Same effect on vocal folds as (A; top), but soft palate functions normally. Bottom: Bilateral lesion. Same effects on vocal folds as (A; bot- 


tom), but soft palate functions normally. (Continued on page 74) 
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Lesions that damage vagal nuclei within the brain stem 
are called intramedullary. When the nerve trunk just 
outside the brain stem but still within the cranial cavity 
is damaged, the lesion is called extramedullary. If the 
lesion affects the nerve after it exits from the skull, it is 
called extracranial. Lesions at any of these three levels 
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Figure 5.3 (Continued) (C) Top: Unilateral lesion of superior 
laryngeal nerve. Both vocal folds adduct, but the anterior larynx may 
twist toward the intact side on phonation. The soft palate is normal. 
Bottom: Bilateral lesion. Both vocal folds adduct but are partially 
obscured by epiglottic overhang. Vocal folds are bowed. The soft 
palate is normal. (D) Top: Unilateral lesion of recurrent laryngeal 
nerve. One vocal fold is fixed in a paramedian position on phonation, 
whereas the other adducts to the midline. The soft palate is normal. 
Bottom: Bilateral lesion. Both vocal folds are fixed in a paramedian 
position on phonation. The soft palate is normal. (E) Myoneural 
junction lesion (myasthenia gravis). There is bilateral paresis or 
paralysis of both adductory and abductory vocal fold movements 
on phonation and reduced or absent soft palate function. 


damage the vagus nerve above the separation of its pha- 
ryngeal, superior laryngeal, and recurrent laryngeal 
branches. Therefore, all muscles supplied by those 
branches of the vagus nerve below the level of the lesion 
will be weak or paralyzed. Affected muscles then will be 
one of three types: (1) those supplied by the pharyngeal 


branch of the vagus nerve. Consequently, the levator 
muscle of the soft palate, the levator veli-palatini, will be 
weak or paralyzed on the same side as the lesion. 
(2) Because the superior laryngeal branch also lies below 
the level of the lesion, the cricothyroid muscle will be 
weak or paralyzed on the side of the lesion. (3) Because 
the recurrent laryngeal branch lies below the level of the 
lesion, the remaining intrinsic muscles of the larynx will 
be weak or paralyzed on the side of the lesion. The vocal 
folds will be paralyzed and fixed in the abducted posi- 
tion unilaterally or bilaterally. At rest and during phona- 
tion, the soft palate will hang lower on the paralyzed 
side, the nonparalyzed side will elevate, pulling the 
uvula and the midline of the soft palate toward the nor- 
mal side. In bilateral lesions, the soft palate will rest at a 
lower than normal position and on phonation will ele- 
vate minimally or not at all bilaterally. 

Unilateral lesions above the bifurcation of the pharyn- 
geal branch of the vagus nerve causing unilateral vocal fold 
paralysis will produce severe breathiness to whispered 
voice and diplophonia and, in some instances, a flutter or 
rapid tremor on vowel prolongation. There will be reduced 
loudness and pitch and possibly falsetto pitch breaks. 

In bilateral vocal folds paralysis where both folds are 
abducted, the voice will be virtually whispered, owing to 
the wider glottal chink than is found in unilateral paraly- 
sis and markedly reduced loudness. Unilateral soft palate 
paralysis will produce mild-to-moderate hypernasality 
and nasal emission, which will be much more severe if 
the paralysis is bilateral. 

Associated signs will be a weak, mushy, or absent glot- 
tal coup or cough, dysphagia, and nasal regurgitation on 
swallowing liquids and solids. Pharyngeal paralysis 
results in a reduced or absent gag reflex and aspiration of 
secretions, which, if severe, may require tracheostomy. 
Diseases that cause tenth cranial nerve lesions at this 
level are of several types: vascular, such as brain-stem 
hemorrhage, thrombosis, and arteriovenous malforma- 
tion; traumatic; primary or metastatic neoplastic; 
congenital defects of bone, such as Arnold-Chiari mal- 
formation; inflammatory, such as poliomyelitis and Guil- 
lain-Barré syndrome; degenerative, such as amyotrophic 
lateral sclerosis (ALS); metabolic, exemplified by myas- 
thenia gravis; toxic, such as metal (arsenic) or organic 
(botulism) poisoning; and others such as multiple sclero- 
sis, diphtheria, and tetany. 


. Extracranial lesions of the superior and recurrent laryn- 
geal nerves, but not the pharyngeal nerve (Fig. 5.3B). 
Extracranial lesions high in the neck can spare the pha- 
ryngeal branch of the vagus but damage the nerve above 
the origin of the superior and recurrent laryngeal 
nerves. Because the pharyngeal nerve lies above the 
level of the lesion, the muscles of the soft palate and 
pharynx will be normal. However, because the superior 
and recurrent laryngeal branches lie below the level of 
the lesion, the cricothyroid and remaining intrinsic 
muscles of the larynx will be paralyzed, having the 
same effects on the vocal folds and voice as described in 
the previous section. 
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Causes of lesions below the bifurcation of the pharyn- 
geal nerve but above the superior laryngeal include surgi- 
cal trauma and infectious and idiopathic disease. The 
latter are discussed in greater detail later in this chapter. 


. Extracranial lesions of the superior laryngeal nerve but 


not the pharyngeal and recurrent laryngeal nerves (Fig. 
5.3C). A lesion of the superior laryngeal nerve causes 
weakness or paralysis of the cricothyroid muscle. Because 
the pharyngeal and recurrent laryngeal branches are 
spared, the soft palate and remaining intrinsic laryngeal 
muscles are normal. On laryngoscopic examination, the 
vocal folds appear deceptively normal. Consequently, 
dysphonia due to cricothyroid muscle weakness is often 
either missed or misdiagnosed as “functional.” In unilat- 
eral cricothyroid muscle paralysis, both vocal folds 
appear to abduct normally on phonation, but on closer 
scrutiny the vocal fold on the affected side may appear 
shorter, and an asymmetric lateral shift of the epiglottis 
and anterior larynx toward the intact side may be noted. 
The notion that unilateral cricothyroid paralysis always 
causes an observable twisting has recently come into 
question suggesting this movement pattern is not always 
present. In bilateral cricothyroid paralysis, there will be 
an absence of tilt of the thyroid cartilage on the cricoid 
cartilage during phonation, the vocal folds will appear 
shorter than normal, the epiglottis will overhang and 
obscure the anterior portion of the vocal folds, and there 
will be bowing of the vocal folds (Fig. 5.4). The soft 
palate will be normal because the pharyngeal branch of 
the vagus nerve has been spared. Unilateral cricothyroid 
muscle paralysis produces mild breathiness or hoarse- 
ness, normal or mildly reduced loudness, and mild 
inability to alter pitch, interfering with singing in the 
upper part of the range and vocal projection. In bilateral 
cricothyroid muscle paralysis, breathiness and hoarse- 
ness will be mild to moderate, loudness will be reduced, 
and ability to alter pitch will be moderately to severely 
impaired, causing serious interference with singing. The 
cause of cricothyroid muscle paralysis is usually surgery, 
other trauma, or infection. 


. Extracranial lesions of the recurrent laryngeal nerve but 


not the pharyngeal and superior laryngeal nerves (Fig. 
5.3D). All muscles supplied by the recurrent laryngeal 
nerve are paralyzed, whereas those supplied by the 
remaining branches of the vagus, namely, the cricothyroid 
muscle and soft palate, are spared. The vocal folds are 
fixed in the paramedian position unilaterally or bilater- 
ally. It is important to understand why the vocal folds are 
not paralyzed in the abducted position, as in the case of a 
lesion above the bifurcation of the superior laryngeal 
branch: the intact cricothyroid muscle, still capable of 
stretching the vocal fold anteroposteriorly, acts as an 
adductor, thereby pulling the vocal fold closer to the mid- 
line. Lesions above the level of the superior laryngeal 
nerve result in fixation of the vocal fold more laterally, in 
the abducted position (causing virtual aphonia), because 
the cricothyroid muscle as well as the remaining intrinsic 
muscles is paralyzed and unable to exert its abducting 
influence. In bilateral paralysis of the vocal folds due to 
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Figure 5.4 Frames from motion picture photographs of vocal folds in 
bilateral cricothyroid muscle paralysis due to superior laryngeal nerve 
lesions. Absence of anterior tilt failing to lengthen and tense the vocal 
folds is noted by failure of anterior movement of the epiglottis, making 
it difficult to visualize the anterior commissure. Also visible is failure of 
complete vocal fold approximation on adduction due to bowing. (From 
Ward, P.H., Berci, G., Calcaterra, T.C. [1977]. Superior laryngeal nerve 
paralysis: an often over-looked entity. Trans Am Acad Ophthalmol Oto- 
laryngol 84, 78-89.) 


recurrent laryngeal nerve lesions only, the vocal fold rests 
so close to the midline that phonation is virtually normal. 
Paralysis of the abductors prevents widening of the glottis 
on inhalation, however, resulting in inhalatory stridor and 
compromising the airway. Breathy-hoarse quality, reduced 
loudness, and in some cases diplophonia and falsetto pitch 
breaks are heard in unilateral vocal fold paralysis from 
recurrent laryngeal nerve lesions. Associated signs of uni- 
lateral vocal fold paralysis are a marginal airway and a 
weak cough or glottal coup. In bilateral paralysis, the air- 
way is severely compromised, resulting in respiratory dis- 
tress often requiring tracheostomy. 

Conditions that cause recurrent laryngeal nerve lesions 
include trauma due to surgery, tumors of the nerve itself 
or impinging upon the nerve, vascular disease, infection, 
metabolic diseases, and idiopathic disorders. 

Occasionally, inhalatory stridor can be mistaken for 
abductor vocal fold paralysis or an asthmatic attack when, 
in fact, these events, in all probability, are psychogenic 
(Appelblatt and Baker, 1981; Cormier, Camus, and 
Desmeules, 1980; Kellman and Leopold, 1982; Patterson, 
Schatz, and Horton, 1974; Rogers, 1980; Rogers and Stell, 
1978) or paradoxical vocal cord dysfunction. Christopher, 
Wood, Eckert, et al. (1983) reported on five patients who 
had dramatic episodes of wheezing and who had been 
diagnosed as having uncontrolled asthma, symptoms that 
continued despite aggressive pharmacologic therapy. 


Laryngologic, psychiatric, and speech pathology studies, 
however, proved that these patients had a nonorganic dis- 
order of the larynx that mimicked bronchial asthma. Indi- 
rect laryngoscopy disclosed that the asthmatic-like 
sounds were coming not from the lungs but from an 
unusual mode of glottic closure on inhalation and exhala- 
tion during tidal breathing. Psychiatric examinations indi- 
cated that all of the patients were unaware of their upper 
airway obstruction and thought that they were having 
bronchial asthmatic attacks. They were unable to repro- 
duce the abnormal sound voluntarily. In all patients, treat- 
ment by a speech pathologist was effective, and after 3 
to 21 months of follow-up, these patients were asymp- 
tomatic. This problem described by Christopher and 
colleagues is what we now recognize as vocal cord dys- 
function, or VCD. It is considered to have many etiologies 
including redundant tissue, anxiety, and damage to the 
brain stem. What must be borne in mind, however, is that 
patients with extrapyramidal movement disorders also 
can produce laryngeal stridor at rest during tidal breathing 
because of adventitious adduction of the vocal folds as a 
component of their movement disorder. Moreover, sudden 
paroxysmal inhalation often accompanies these stri- 
dorous inhalations. 


. Myoneural junction disease: myasthenia gravis (Fig. 5.3E). 


The reduced availability of acetylcholine at the myoneural 
junction in myasthenia gravis causes progressive flaccid 
weakness or paralysis with strenuous muscular effort, as 
in prolonged voice use. Sometimes, muscles supplied by 
the vagus nerve are the first to be affected by this disease. 
When it is confined to the larynx and soft palate, the signs 
often erroneously lead the examiner into nonorganic 
avenues of diagnostic thinking. Because myasthenia 
gravis causes bilateral weakness of the intrinsic laryngeal 
muscles, the vocal folds abduct incompletely, sometimes 
bowing during phonation. Because abduction may be 
incomplete as well, inhalatory stridor occurs in more 
severe cases. The soft palate elevates minimally or not at 
all on phonation, resulting in hypernasality and nasal 
emission. The voice of patients with myasthenia gravis is 
breathy, sometimes hoarse, and flutter or tremor-like 
fluctuations on vowel prolongation may be heard. Loud- 
ness is usually reduced, and there is a restriction of pitch 
range. All defects become progressively worse with pro- 
tracted speaking. 

Most often, myasthenia gravis produces the full comple- 
ment of flaccid dysarthria: consonant imprecision, breath- 
iness, hypernasality, and nasal emission due to associated 
velar, tongue, lip, mandibular, and respiratory muscle 
weakness. The cough and glottal coup are weak or mushy, 
depending on the severity of the disease at the time of 
examination. The etiology of myasthenia gravis has been 
described as a defect in the neurochemistry at the 
myoneural junction, specifically a failure of acetylcholine 
production. However, recent research gives evidence of an 
autoimmune mechanism responsible for the disorder. 


. Disease of muscle. Muscular dystrophies or myopathies can 


cause flaccid dysphonia. The three types of muscular dys- 
trophies are the pseudohypertrophic (Duchenne), the limb- 
girdle, and the facioscapulohumeral (Landouzy-Dejerine). 
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Their general features are a family history of disease, early 
age of onset, and atrophy or dystrophy of muscle. A hered- 
ofamilial type of myopathy is myotonic dystrophy, in 
which there is selective atrophy of muscles, including the 
speech musculature, resulting in flaccid dysarthria, 
including dysphonia. The masticatory and facial muscles 
may be especially involved. The disease is progressive and 
appears for the first time in adulthood, between the ages 
of 20 and 30 years. 


7. Idiopathic vocal fold paralysis. Some vocal fold paralyses 
defy diagnosis; that is, a specific etiology cannot be dis- 
covered despite thorough investigation. These are called 
idiopathic! vocal fold paralyses. Williams (1959) reviewed 
the records of patients with vocal fold paralysis over a 
6-year period; 66 of 181, or 36%, had an idiopathic disor- 
der, consistent with the 33% found by New and Childrey 
(1932). The average duration was ~5 months, after which 
spontaneous recovery occurred, although the dysphonia 
averaged 1 month longer than the vocal fold paralysis. 
The left vocal fold was paralyzed in 43 patients (65.2%), 
the right in 19 (28.7%), and 4 (6.1%) had bilateral paraly- 
ses. Blau and Kapadia (1972) consider idiopathic uni- 
lateral vocal fold paralysis to be one of the cranial 
mononeuropathies: a benign condition in which individ- 
ual cranial nerve function is partially or totally impaired, 
the site of involvement undetermined and the etiology 
unknown, and which has a high rate of spontaneous 
recovery within a period of 9 months, as in the case of 
idiopathic tenth cranial nerve interference. 


Pathophysiology of Flaccid Dysphonia The basis of the 
abnormally breathy and weak voice of flaccid vocal fold 
paralysis is an increased volume velocity of airflow through 
the glottis and a failure of complete vocal fold vibration 
because of its flaccidity and improper position within the 
airway. The perceived breathiness is produced by air turbu- 
lence as air flows through the glottis, which remains open 
throughout the glottic cycle. Posteroanterior tomograms of 
normal and paralyzed vocal folds are shown in Fig. 5.5. In 


'As defined by Dorland’s Illustrated Medical Dictionary, a morbid state of 
unknown causation or of spontaneous origin. 


Figure 5.5 X-ray tomograms (posteroanterior) show- 
ing right vocal fold paralysis during quiet breathing, 
phonation, and effort closure. (A) Quiet breathing. The 
paralyzed vocal fold projects into the airway. The 
smoothly contoured side is the normal side. (B) Prolon- 
gation of the vowel /i/. The normal vocal fold adducts to 
meet the paralyzed one, giving a deceptively normal 
appearance, but there are asymmetric contours of the 
subglottic walls. (C) Bearing down producing effort clo- 
sure, illustrating the retained capacity for gross glottic 
closure activity. (From Ardran, G.M., Kemp, F.H., 
Marland, P.M. [1954]. Laryngeal palsy. Br J Radiol 27, 
201-209.) 


Fig. 5-5A, the tomogram is of a patient with right unilateral 
vocal fold paralysis in which the fold projects out into the 
glottis during quiet breathing. The following points are also 
notable. 


1. The vocal fold on the normal left side is turned upward, its 
free edge close to the lower surface of the ventricular fold. 
The laryngeal ventricle is barely visible. 


2. On the paralyzed side, the vocal fold projects across the 
glottis, its edge close to the midline. The laryngeal ventri- 
cle is wide and the ventricular fold is prominent and 
nearer to the midline than is the normal side. 


3. The lateral wall of the subglottic space on the normal side 
is flattened or slightly convex and on the paralyzed side it 
is concave. The airway is consequently narrowed, and its 
central axis is deviated toward the normal side. 


In Fig. 5.5B, the same patient is prolonging the vowel /i/. 
This illustration shows: 


+ The normal vocal fold on the left approximates the para- 
lyzed fold on the right and is aligned more horizontally 
with a thickened free edge. 


+ On the paralyzed side, the vocal fold is tilted slightly 
upward and its free edge is “tongue-shaped.” 


+ The laryngeal ventricle is larger on the paralyzed side. 


+ The glottal slit appears to be displaced toward the para- 
lyzed side. 


+ The ventricular fold appears to be more prominent on the 
paralyzed side. 


The tomogram of Fig. 5.5C is of the patient during effort 
closure while bearing down. Despite the paralysis, effort clo- 
sure is complete, demonstrating the compensatory ability of 
the sphincter-like closure mechanism, although in some 
patients the location of closure differs. 

Posteroanterior tomograms of a patient with bilateral 
vocal fold paralysis are shown in Fig. 5.6. The tomogram of 
Fig. 5.6A illustrates quiet breathing, showing the vocal folds 
resting toward the midline, creating a narrower than normal 
glottis. On prolongation of the vowel /i/ (Fig. 5.6B), there is 
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no demarcation between the true and false vocal folds, con- 
trary to what is expected in the normal larynx. The axis of 
the vocal tract is asymmetric, and there is a wide glottal 
channel even during voice production. The tomogram of Fig. 
5.6C shows the same patient bearing down, again illustrating 
the reserve sphincter power of the larynx despite a bilateral 
vocal fold paralysis and inadequacy of phonation. In fact, all 
15 patients examined by Ardran, Kemp, and Marland (1954), 
despite their dysphonias and aphonia, were able to adduct 
the vocal folds during gross sphincteric effort closure. 

Breathy voice quality and reduced loudness in unilateral 
and bilateral vocal fold paralysis, fewer syllables per exhala- 
tion, and more frequent inhalation during speech result from 
increased airflow through the glottis. 


Figure 5.6 X-ray tomograms (posteroanterior) show- 
ing bilateral vocal fold paralysis during quiet breathing, 
phonation, and effort closure. (A) Quiet breathing. Both 
vocal folds project into the airway. (B) Prolongation of 
the vowel /i/. Incomplete adduction of true vocal folds 
and lack of demarcation between true and false folds are 
seen. (C) Bearing down producing effort closure, illus- 
trating the retained capacity for gross glottic closure 
activity. (From Ardran, G.M., Kemp, F.H., Marland, P.M. 
[1954]. Laryngeal palsy. Br J Radiol 27, 201-209.) 





Hirano, Koike, and von Leden (1968) obtained maximum 
phonation times, mean flow rates, and phonation quotients 
of patients who had vocal fold paralyses (Table 5.2). In 10 of 
13 patients, maximum phonation time was shorter than 
normal, the product of excess expenditure of air. Mean flow 
rates in 8 of 13 patients were greater than normal, and in 7 
of 11 patients the phonation quotients were greater than 
normal. In 3 patients whose vocal folds had been injected 
with Teflon, clinical improvement was confirmed by an 
increase in maximum phonation time and a decrease in 
mean flow rate and phonation quotient. Ultraspeed motion 
picture studies of unilateral vocal fold paralyses show 
marked pitch and amplitude perturbations caused by the 
different rates of vibration of each vocal fold (Fig. 5.7). 


Table 5.2. Maximum Phonation Time, Mean Flow Rate, and Phonation Quotient in Cases of Vocal Cord Paralysis* 


Maximum Phonation 


Case Sex Age (Years) Time (Seconds) 
43 M 39 14.4 
44 M 45 3.9 
45 18.7 
45 M 59 5.8 
59 15.6 
46 M 32 6.8 
47 M 71 9.9 
71 24.1 
48 M 78 21.6 
49 M 42 36.0 
50 F 29 7.6 
51 F 40 7.4 
52 F 67 18.8 
53 F 69 12.8 
54 F 64 8.1 
55 E 47 9.5 
47 20.6 








Mean Flow Phonation Quotient 
Rate (cm?/s) (cm?]s) 

205 331 Left, paramedian 

382 1254 Left, intermediate, 

125 261 with atrophy. 
After Teflon 
injection 

484 559 Left, intermediate, 

201 208 with atrophy. 
After Teflon 
injection 

621 706 Left, intermediate, 
with atrophy 

270 310 Left, intermediate. 

117 128 After injection 

99 141 Left compensated 

53 126 Right compensated 

280 : Bilateral paramedian 

378 i Bilateral paramedian 

92 160 Bilateral paramedian 

105 189 Bilateral paramedian 

309 316 Bilateral paramedian 

315 346 Right, intermediate. 

145 160 Restricted 
abduction 


Source: From Hirano, M., Koike, Y., von Leden, H. (1968). Maximum phonation time and air usage during phonation. Folia Phoniatr 20, 158-201. 


*Respiratory tests inadequate. 
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Diplophonia, the simultaneous perception of two different 
pitches, is common in unilateral vocal fold paralysis. It is pre- 
sumably caused by each vocal fold vibrating at a different fre- 
quency, owing to differences in their tensions. Triplophonia, 
three pitches being heard simultaneously, has also occurred. 


Etiology of Vocal Fold Paralyses Producing Flaccid Dys- 
phonia Ina study of 633 cases of vocal fold paralysis, Hup- 
pler, Schmidt, Devine, et al. (1955) found that the largest num- 
ber, 272 patients (43%), was caused by thyroidectomy. The 
female to male ratio was 6:1, and in half the group, the vocal 
fold paralysis was bilateral. Treatment has changed since the 
1950s, and the percent of paralysis cases caused by thyroidec- 
tomy has decreased. Cancer of the breast, thyroid gland, lung, 
tongue, mouth, pharynx, or larynx, as well as other primary and 
metastatic cancers of the neck was the reason for surgery in 
these cases. Cardiovascular disease produced paralysis on the 
left side in all cases. The specific causes of heart disease were 
mitral stenosis, syphilitic aortitis and aneurysm, nonsyphilitic 
aneurysm, cardiomegaly, and congenital heart disease. Other 
causes of paralysis included specific neurologic diseases such as 
syringobulbia, poliomyelitis, and multiple sclerosis; trauma 
caused by automobile accidents, gunshot wounds, falls and 
blows to the neck, endotracheal intubation, bronchoscopy 
ingested foreign body; infection; diphtheria; granuloma and 
empyema. It is important to remember that 184 (29%) were 
idiopathic as previously discussed. This figure, too, has changed 
over the years because of improved diagnostic abilities. 

Vocal fold paralysis and flaccid dysphonia can occur as one 
of several signs in certain neurologic disease syndromes. In 
the jugular foramen syndrome, a lesion, usually a tumor, 
within or adjacent to the jugular foramen compresses cranial 
nerves IX, X, and XI, causing ipsilateral weakness of the pha- 
ryngeal, laryngeal, trapezius, and sternocleidomastoid mus- 


Figure 5.7 Frames from ultraspeed photography of 
unilateral vocal fold paralysis. Male larynx with paralysis 
of the left vocal fold. The sequence of photographs pro- 
gresses from left to right in the top row followed by the 
same order in the second row. The graph illustrates the 
motions of the approximate anteroposterior midpoints 
of the glottal borders and demonstrates the absence of 
glottal closure, the crossing of an arbitrary median sagit- 
tal plane by the healthy vocal fold reaches its medial 
limit, and vibrations of both folds at the same frequency. 
(From Moore, G.P. [1976]. Observations on laryngeal dis- 
ease, laryngeal behavior and voice. Ann Otol Rhinol 
Laryngol 85, 553-564.) 


cles. Another syndrome is caused by posterior inferior cere- 
bellar artery occlusion, producing infarction of the lateral 
medulla. Known as Wallenberg’s syndrome, it is marked by 
dysarthria and dysphagia, ipsilateral impairment of pain and 
temperature sensation on the face, and contralateral loss of 
pain and temperature in the trunk and extremities. Because 
the nucleus ambiguus undergoes destruction, unilateral 
vocal fold paralysis and flaccid dysphonia occur. In general, 
any vascular or space-occupying lesion within the posterior 
fossa is capable of damaging one or both tenth cranial nerve 
nuclei or their nerve trunks, causing unilateral or bilateral 
vocal fold paralysis and dysphonia. 


Special Considerations in the Diagnosis of Flaccid 
Dysphonia: Myasthenia Gravis 


1. Breathy voice and reduced loudness, even though the 
laryngologic examination is normal, ought to alert the cli- 
nician to the possibility of flaccid dysphonia due to lower 
motor neuron disease. 


2. Worsening of breathiness and weakness as a consequence 
of sustained effortful speaking ought to alert the clinician 
to the specific flaccid dysphonia of myasthenia gravis. The 
fewer pathologic changes seen laryngologically, the more 
important is clinical listening. Bilateral, minimal weakness 
of the vocal folds is the most common reason for the failure 
to identify flaccid dysphonia laryngoscopically and is the 
reason such patients are often misdiagnosed as having 
functional or psychogenic voice disorders. On videostrobo- 
scopic examination, the vocal folds generally appear intact, 
but the closed phase is extremely short, and aperiodicity 
may be increased. It is important to remember that flaccid 
dysphonia can be the sign of an encroaching neurologic 
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disease of the peripheral nervous system. The dysphonia 
of myasthenia gravis is a classic example of a frequently 
misdiagnosed voice sign. Most of the time, it produces a 
complete dysarthria involving phonation, resonation, and 
articulation and is not difficult to identify. (See Table 5.3 
for a complete description of the dysarthria.) The changes 
of identification become considerably poorer when only 
one of the components of the peripheral speech mecha- 
nism is affected; the larynx is such a component. Several 
cases of monosymptomatic dysarthria in myasthenia 
gravis have been reported, as for example hypernasality 
(Wolski, 1967). Myasthenia gravis focal to the larynx 
producing breathiness and weakness without accompa- 
nying hypernasality of articulatory defect has also been 
reported (Neiman, Mountjoy, and Allen, 1975). In an 
article on early motor unit disease misdiagnosed as 
psychogenic dysphonia, Aronson (1971) reported the fol- 
lowing case study. 


Case Study 5.1 


A 20-year-old secretary returned from vacation, and her 
parents noted that she had a weak voice. Two laryngologists 
and one internist diagnosed her voice change as “functional 
(nonorganic) dysphonia.” Yet psychiatric examination was 
unable to find a plausible psychologic explanation for her 
voice disorder. The following edited transcript of the intake 
interview shows the importance of history-taking in the 
diagnosis of voice disorders and how small clues can help 
the clinician think deductively about possible causes. 


Clinician: When did all this speech business begin? 

Patient: | was never one for speaking, because | was 
always quiet, but | suppose this summer I’ve noticed, uh, 
well, its been called to my attention that | slur words. 

C: What were you doing this summer when your voice 
disorder began? 

P: Oh, | don’t know. Like, I’d be sitting at the supper 
table and my words would seem all jumbled together. | 
think my parents were the first ones that called this to my 
attention. Ever since then I’ve caught myself. 

C: Were you feeling well otherwise? 

P: Yes. |had gone on vacation in May. | went down to the 
Bahamas about two weeks. 

C: When you came back from vacation, did you go right 
back to work? 

P: Yes, like, l’d be on the phone a lot. My voice, first my 
throat, like my words wouldn’t come out right, like they’d 
have to shove their way through. 

C: Did you have any funny feelings in your throat? 

P: Yeah, I’ve had difficulty swallowing. | mean, it doesn’t 
hurt, or anything. It’s just that it feels like a narrow pas- 
sage—either when | am talking or swallowing. 

C: Have you had any occasion to notice fluids coming 
through your nose as you swallowed? 

P: Wait, like when I’m drinking like when it seems to go 
right up my nose? 

C: Yes, how often? 

P: Quite a bit of the time when | drink. 


Perceptual 
Phonation 


Resonation 
Articulation 


Language 
Physical 
Larynx 


Velopharynx 


Tongue 
Lips 


Teeth 
Hard palate 
Mandible 


General Medical 


Other Neurologic 
Signs 
Peripheral 
nervous system 
Central 
nervous system 


Psychiatric] 
Psychologic 


Table5.3 Laryngeal-Phonatory Characteristics in the Flaccid 
Dysarthria of Myasthenia Gravis 


Findings 


Breathy voice quality, weak intensity. 
Deterioration of phonation during 
stressful counting or other prolonged 
speaking activities; reduced sharpness of 
cough after stressful speaking. Can exist 
in the absence of remaining signs of 
dysarthria 

Initially normal; hypernasality develops after 
stressful speaking 

Initially normal; articulatory imprecision 
develops after stressful speaking 

Normal 


In milder cases, vocal folds may appear 
normal in structure and function despite 
dysphonia; absence of positive 
laryngologic findings does not exclude 
presence of milder degrees of bilateral 
adductor weakness of vocal folds. In 
more severe cases, folds may fail to 
adduct and abduct completely, bilaterally. 
Bowing may be present. 

Initially normal; velopharyngeal 
insufficiency after stressful speaking. 
Hypoactive gag reflex 

Initially normal; tongue weakness after 
stressful speaking 

Initially normal; lip weakness after stressful 
speaking; lateral smile 

Normal 

Normal 

Initially normal; mandibular muscle 
weakness after stressful speaking 





Patient may complain of general fatigue, 
particularly after exercise. Findings can be 
nonspecific, leading to erroneous 
diagnosis of functional illness 


Weakness of bulbar musculature; positive 
Tensilon test 
Normal 


Patient may complain of fatigue, loss of 
energy, or loss of interest, which may be 
erroneously diagnosed as signs of primary 
depression. Patients may become 
secondarily depressed about lack of 
energy. 


C: And that is something new? 

P: Yes, it didn’t happen before this summer. And, my 
mouth was always really dry. You know, like when you 
wake up in the morning? It feels like that all day. 

C: Tell me, any emotional problems lately? Depressed at 
all? That kind of thing? 
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P: Hmm, nothing exciting’s happened, but | don’t think 
I’ve been really depressed. 

C: Your speech, you know, gives the impression of a cer- 
tain lack of vitality. Not very vivacious. Are you that type of 
person? 

P: No, when | was at college | was a cheerleader, and | 
was loud then. 

C: You were loud then! (C. surprised at this description 
because of their extremely flat, lethargic affect.) 

P: Yes, | mean, | could really belt it out, hours on end; 
but now, it’s like | get tired inside, or something. 

C: And that, again, has been true just since last sum- 
mer? 

P: Yes, | slowly noticed it. 

C: How do you get along with your family at home? 

P: Fine. 

C: No problems especially around June? How about 
your love life? Anything developing there? Any disap- 
pointments? 

P: | would like to get married next fall. 

C: And, how long have you been going with this person? 

P: About a year and a half. 

C: Has this been a satisfactory relationship? 

P: Uh huh. 

C: | want to be sure that there isn’t something that’s 
really disappointing you, or something that you’re keep- 
ing back, because if there is, we'd like to know about it. 

P: No. Truthfully, | can’t say that there is any one thing 
that I’ve been depressed about. 

C: When you say that there isn’t any one thing, does 
that mean there are many things? 

P: No (long pause) | don’t know, I get nervous fast. 

C: Tell me about that. 

P: There’s not much to say. | don’t tolerate much. | know 
| should, but | just, I’m short-tempered, and little things 
irritate me. 

C: Like what, for example? 

P: For example, like last week when | was here and we 
flew home and we flew over Chicago for an hour and a 
half, | was madder than a wet hen. And then they flew us 
back to Madison and we had to take the bus. [Laughs] | 
was just so nervous and uptight because of that. | was just 
so, | didn’t want to talk to anybody. 

C: Have you found this to be more apparent as of late? 
Or is this pretty much the way you’ve always been? 

P: Hmm, I’ve always been kind of short-fused. 

C: So this recent thing with the airline delay, your reac- 
tion didn’t surprise you, particularly. 

P: No, not really. 


Physical examination of her peripheral speech musculature 
failed to disclose any obvious asymmetries or weakness of the 
lips, tongue, mandible, or soft palate at rest or during volun- 
tary movement. Her general demeanor was introverted and 
expressionless, undoubtedly fostered by the low volume of 
her voice, its breathy quality, and the absence of pitch and 
inflection variations. Her cough was possibly reduced in 
explosive power, but one could not be sure. The test for fatigue 
of the motor speech mechanism in which the patient is asked 


to count rapidly and vigorously until told to stop revealed 
deterioration, not only of phonation in the direction of greater 
breathiness but also of palatopharyngeal and articulatory 
functions as well by the time the patient had reached the 
count of 90. Then number of syllables per exhalation began to 
drop because of air wastage at the glottis, and by the time the 
patient had reached 100, hypernasality and nasal emission 
had become apparent; one could feel vibration on the bridge 
of her nose during production of non-nasal phonemes. By 150, 
her hypernasality and nasal emission had become clearly 
audible, and by 225 her articulation was markedly indistinct, 
the patient physically unable to continue. 

The neurologic examination revealed mild but signifi- 
cant muscular weakness; there was slightly greater width 
of the left palpebral fissure than the right, mild facial 
weakness on the left, mild diplopia on lateral gaze, bilat- 
eral weakness of the soft palate, and on repetition of the 
counting test of endurance, a similarly high degree of 
speech deterioration. 

After speech deterioration due to prolonged counting, 0.2 
mL edrophonium chloride (Tensilon edrophonium chloride; 
ICN Pharmaceuticals Inc., Costa Mesa, CA) was administered 
intravenously. The patient had begun to count immediately 
after the injection, and ~30 seconds later, audible improve- 
ment in her speech was noted, and according to the patient, 
it was approaching normal. Electromyography provided evi- 
dence of defective neuromuscular transmission in all mus- 
cles tested with repetitive electrical stimulation, and the 
diagnosis of flaccid dysphonia, and later dysarthria, due to 
myasthenia gravis was made. 

A common clinical diagnostic trap is the interpretation of 
signs and symptoms as psychogenic because the patient has 
a history of psychiatric illness. That the complaint can be 
organic in spite of such a history is illustrated in the follow- 
ing lesson learned by Ball and Lloyd (1971). 

A slim, 16-year-old girl was admitted to the psychiatry 
ward after a series of arguments with her mother. She had 
been staying with an aunt who was a member of a religious 
sect in which “speaking in tongues” periodically occurred 
during times of religious ecstasy. Religion was continually 
held up to the patient as a way of salvation. 

One day during an unusually emotional religious service, 
when the congregation was praying for the patient, she sud- 
denly lost her voice. It returned spontaneously, but disap- 
peared again on a similar occasion. She also complained of 
feeling tired and lacking energy. A few months later, she 
developed difficulty in swallowing food, suffered aspiration 
pneumonia, and had to be hospitalized. Physical examina- 
tion, including neurologic studies, gave negative results, and 
the patient was placed under the care of a psychiatrist. It was 
his opinion that she was suffering from “hysteria,” and a sec- 
ond psychiatrist described her as having severe personality 
disturbance with schizoid features and tending toward psy- 
chotic episodes. 

In light of the previous negative physical findings, the dis- 
turbed family background, a schizophrenic mother, a pre- 
morbid history of shyness, and living with an aunt who 
placed great importance on “speaking in tongues,” a diagnos- 
tic formulation of hysterical aphonia, dysphagia, diplopia, 
and fatigue in a vulnerable personality would have been 
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understandable. Instead, the patient was closely watched. 
Fatigability became obvious and confirmed the suspicion 
that the patient had an organic disease. Subsequently, myas- 
thenia gravis was clearly demonstrated. After treatment 
with pyridostigmine (Mestinon, pryridostigmine bromide, 
USP; valeant Pharmaceuticals International, Atiso Viejo, CA) 
and thymectomy, her condition improved. She became 
attractive and outgoing, developed a wide range of interests, 
and began working normally as a typist. 

Equivocal and episodic physical signs in early myasthenia 
gravis are well-known and are often precipitated by emo- 
tional stress, accounting for the disease being commonly 
mistaken for a conversion disorder. This case illustrates the 
dangers of indiscriminate use of the “hysterical” diagnostic 
label, stresses the value of adequate neurologic training, and 
draws attention to the pitfalls resulting from reliance on pre- 
conceived ideas in diagnosis. 

Occasionally, conversion dysphonia can most certainly 
masquerade as myasthenia gravis. Certain patients, 
because of knowledge of the disease gained through read- 
ing or contact with others who have had it, develop, on a 
conversion reaction basis, voice signs similar to those 
found in myasthenia gravis. By use of a control such as 
intravenous saline (the patient believing it to be Tensilon), 
the physician can establish whether, in fact, myasthenia 
gravis is present. 


Case Study 5.2 


An 82-year-old former elementary school teacher was 
evaluated for severely breathy dysphonia which had 
been diagnosed elsewhere as myasthenia gravis and for 
which she had been taking pyridostigmine (Mestinon). 
She was of the opinion that her dysphonia signified that 
she was being improperly treated for her disease. The 
neurologist began injecting normal saline solution, 
the patient believing that he was using Tensilon. As she 
began to count, by the time she had gotten to 10 her 
voice had become perceptively louder, and by 50 it had 
returned to normal. After informing the patient that she 
did not have myasthenia gravis, we found that dyspho- 
nia was psychogenic and related to increasing irritation 
with her hard-of-hearing husband, with whom she 
found she could no longer communicate. Her voice dis- 
order was eventually psychiatrically diagnosed properly 
as a conversion disorder. 


Myasthenia gravis is a neuromuscular disease manifested 
by weakness and fatigability of striated muscles. Although 
its clinical features were isolated by the beginning of the 
20th century, it was much later that the mechanism of the 
weakness was identified: a reduction of available acetyl- 
choline receptors at the neuromuscular junction caused by 
autoimmune mechanisms. 

Muscle contraction is dependent upon stimulation of 
the motor end-plate by the neurochemical transmitter 
acetylcholine, which is synthesized in the motor nerve ter- 
minals and is stored in vesicles. Each vesicle, or “quantum,” 


contains ~10,000 acetylcholine molecules. When a nerve 
impulse is transmitted down the axon of the lower motor 
neuron, the acetylcholine is released at specific sites oppo- 
site acetylcholine receptors on the postsynaptic mem- 
branes. When the acetylcholine combines with its receptors, 
a transient increase in membrane permeability to sodium 
and potassium ions results in muscle depolarization. This 
process is extremely swift, occurring in milliseconds, and is 
terminated by removal of acetylcholine by diffusion away 
from the myoneural junction and by the counteraction of 
acetylcholinesterase, which hydrolyzes the acetylcholine. 
The amplitude of depolarization is directly related to the 
number of acetylcholine molecules that stimulate receptor 
molecules. Any reduction of acetylcholine receptor inter- 
actions below a safely margin will result in failure of neu- 
romuscular transmission, a principle necessary to the 
understanding of the decrement of muscle strength in 
myasthenia gravis. 


Spastic (Pseudobulbar) Dysphonia: Pyramidal 
Corticobulbar System Lesions 


The remaining neurologic dysphonias are caused by supranu- 
clear lesions, that is, lesions above the level of the tenth cra- 
nial nerve nucleus and within the CNS. Not much is known 
about normal CNS control over the tenth cranial nerve, 
although probably corticobulbar tracts synapse with vagus 
nerve nuclei in the same way as they do with the other cra- 
nial nerve nuclei. However, it is also likely that vagus nerve 
nuclei receive a disproportionately larger share of subcortical 
innervation to facilitate reflex laryngeal function, owing to its 
important life-sustaining responsibility. Such subcortical 
representation is substantiated by clinical and experimen- 
tal evidence that phonation is preserved despite congenital 
absence of cerebral tissue and massive surgical, traumatic, 
infectious, or degenerative destruction of the cerebral hemi- 
spheres. What is singular about CNS control over the final 
common pathway to the larynx is the apparent extensive 
neural redundancy of the system to ensure the organism’s 
survival, and the bilaterality of its laryngeal functions. The 
integration of the two sides of the larynx is such that under 
normal conditions, a discoordinated movement between the 
vocal folds has never been reported; what one side of the lar- 
ynx does the other does simultaneously and equally, as if 
both right and left halves of the larynx were under the control 
of a single CNS center—unlike the unilateral control of the 
extremities and the lower facial musculature. 

The tenth cranial nerve nuclei are under the influence of 
both pyramidal and extrapyramidal systems. Voluntary con- 
trol over the larynx is accomplished through the pyramidal 
system, sometimes referred to as the direct activating sys- 
tem, and neuroanatomically known as the corticobulbar 
tracts. Figure 5.8 schematizes the following. 


1. The corticobulbar tracts begin in the precentral gyri of 
both cerebral hemispheres. 


2. They descend as the corona radiata, become condensed in 
the internal capsule, and pass through the cerebral 
peduncles. 
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Figure 5.8 Pathways of corticobulbar tracts and their synapses with 
the tenth cranial nerve (vagus) nuclei. Ipsilateral as well as contralateral 
innervation explains why unilateral corticobulbar tract lesions do not 
produce unilateral vocal fold paralyses. 


3. At the upper border of the medulla, some of the fibers 
cross over (decussate) to the other side of the brain stem, 
where they synapse with vagal nuclei in the nucleus 
ambiguus. The remainder synapse with vagal nuclei on 
the same (ipsilateral) side. The implication of such bilat- 
eral corticobulbar innervation is that rarely, if ever, will a 
unilateral cortical or corticobulbar tract lesion cause 
unilateral vocal fold paralysis. Such is not the case when 
the cell bodies or axons of the corticobulbar tracts are 
damaged bilaterally. Then, a characteristic and often 
severe dysphonia called pseudobulbar (spastic)—not to be 
confused with spasmodic—dysphonia is heard, the voice 
having a harsh, strained-strangle quality, abnormally low 
pitch, monopitch, monoloudness, and reduced loudness. 
Rarely, however, does the dysphonia occur in the absence 
of its accompanying dysarthria. (See Table 5.4 for a sum- 
mary of signs found in spastic [pseudobulbar] dysarthria.) 
Botulinum toxin vocal fold injection was used for 
pseudobulbar dysphonia with some success; acoustic and 
aerodynamic results were comparable with those 
reported for individuals with spasmodic dysphonia. The 
most marked change was an increase in airflow. Despite 
persistent breathiness, the participant reported great sat- 
isfaction with the result because of more appropriate 
loudness and less listener burden because of a more 
relaxed and pleasant voice quality after injection 
(McHenry, Whatman, and Pou, 2002). Before injection, 
the strained-strangled, harsh voice quality is caused by 


Table 5.4 Laryngeal-Phonatory Characteristics in 
Pseudobulbar (Spastic) Dysarthria 


Findings 

Perceptual 

Phonation Hoarseness or harshness having a strained- 
strangled quality. Pitch is abnormally low. 
Monopitch. Loudness is reduced. 
Monoloudness. Almost never occurs 
without accompanying signs of 
dysarthria. Inappropriate crying or 
laughter may be present. 

Resonation Hypernasality 

Articulation Imprecise consonants; abnormally slow rate 

Language Normal, provided language areas are spared 

Physical 

Larynx Vocal folds appear normal in structure. 
Normal to hyperadduction of true and 
false vocal folds may occur bilaterally. 

Velopharynx Bilateral velopharyngeal insufficiency. 
Hyperactive gag reflex 

Tongue Topographically normal; may be smaller, 
more contracted than normal. Tongue 
weakness. Slow alternate motion rate 
(AMR) on lateral movements an on /tA/ 
and /kA/ syllable repetitions. 

Lips Weak, slow movements on AMR for /pA/ 

Teeth Normal 

Hard palate Normal 

Mandible Slow AMR 

General Medical Nonspecific 

Other 

Neurologic Signs 
Peripheral Normal 
nervous system 
Central nervous Signs of spasticity 


system 


Psychiatric] 
Psychologic 


Nonspecific. Pseudobulbar crying and 
laughter may give erroneous impression 
of emotional liability and intellectual 
deterioration. 


hyperadduction of the true and false vocal folds (i.e., glot- 
tic constriction and resistance to the exhalatory airflow). 


Pseudobulbar Crying and Laughing Of all the different 
sounds that humans produce, crying and laughing are 
unique reflex responses that originate from multiple 
sources in the CNS and that become disinhibited when 
damaged from neurologic disease. Pyramidal and extrapyra- 
midal tract lesions can produce compulsive crying and 
laughter because of a “release” of volitional control excited 
by an inborn mechanism that underlies these emotional 
responses. 

Included in these CNS regions are unilateral and bilateral 
corticobulbar tract lesions, globus pallidus, putamen, thala- 
mus, anterior regions of the internal capsule geniculate fas- 
cicle supranuclear pontine center, hypothalamus, floor of 
third ventricle, motor cortex, and temporal lobe. 
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Called emotional lability or emotional incontinence, these 
phenomena are most commonly observed in patients with 
the syndrome of pseudobulbar palsy and progressive 
supranuclear palsy that includes spasticity of the extremities 
and spastic dysarthria. 

Seriously incapacitating, these emotional responses need 
to be recognized as one of the great psychologic burdens 
borne by these patients. This emotional lability usually 
occurs in response to emotionally loaded conversation 
between patients and others. Usually, the triggering topic is 
touching, sad, or sentimental. The cry emerges suddenly 
without antecedent frowning or the graded smile before 
laughter, so that these responses give the impression of 
being stereotyped automatic and paroxysmal. The patient 
may switch back and forth between the two responses, but 
once it has begun, it is almost impossible to stop or arrest. 

The cry sometimes is associated with clinical depression 
secondary to the realization that the patient is suffering from 
a lethal illness that has a bleak future. Depressive crying is 
different from the compulsive neurologic crying described 
here; it is less abrupt, explosive, or precipitous. Aronson 
(1980) published an article describing his own emotional 
lability associated with a thromboembolic stroke that 
occurred during thoracic surgery. The embolus lodged in his 
right middle cerebral artery distribution including branches 
to his right internal capsule, producing a left spastic hemi- 
plegia and dysarthria producing deep depression and grief 
over his inability to walk or use his left upper extremity. 

During his recovery in the hospital, he would experience 
sudden uninhibited crying in reaction to almost all stimuli 
that were sad, touching, or sentimental. These episodes 
were embarrassing and disconcerting because of the ease 
with which they could be triggered and their intensity and 
uncontrollability. Music could easily precipitate the cry, 
especially violin music (he had played the violin since child- 
hood). Now he could not even hold the violin with his left 
hand. The very issue would bring forth explosive sobbing 
accompanied by copious tears. He wrote, “I knew that clini- 
cally I was depressed for the loss of a lifelong companion, 
my left side.” 

With the cry accompanying emotional liability, you feel 
as if you are in the clutches of some kind of seizure. Subjec- 
tively, the cry does not feel particularly different from other 
crying but is unique in its sudden unpredictability and 
inability to arrest once it has started. It begins with a feel- 
ing of constriction or contraction in the thoracic, abdomi- 
nal, laryngeal, and pharyngeal muscles then rolls into unin- 
hibited forced exhalation and a forced sustained phonation 
rising in pitch and loudness. Some clinicians have said that 
the cry and laughing lack justification or motivation. The 
emotional incontinence is not considered to be an affective 
disorder proper like those found in psychiatric practice. 


1. The degree of facial, thoracic, and abdominal muscle con- 
traction visibly and incongruously exceeds accompanying 
phonatory signs of crying, laughing, or tears. One acoustic 
characteristic of the cry is a long wail that rises in pitch, 
often reaching falsetto levels. At times the crying will be 
produced with an unvoiced air stream, aptly labeled mute 
grimacing. 


2. The patient may switch back and forth between crying 
and laughing without reason, giving the erroneous 
impression of emotional liability. 


Such patients are subject to misinterpretation as being not 
only emotionally unstable but demented as well, a presump- 
tion not supported by a study of 60 patients who had 
pseudobulbar palsy. In this study, it was found that when 
intelligence test scores were adjusted for age and slowness 
of response caused by the spasticity, the patients performed 
as well as normal controls. 

Lesions that produce pathologic laughter can be located in 
the corticobulbar tracts and beyond. Involuntary laughter 
has been reported in patients with tumors of the brain stem 
and hypothalamus, caused by either stimulation of the floor 
of the third ventricle or infarction of the temporal lobe 
(Achari and Colover, 1976; Haymaker and Kuhlenbeck, 1976). 
Other studies have implicated the internal capsule and sur- 
rounding basal ganglia. 

Pseudobulbar palsy can result from any lesion of the corti- 
cobulbar tracts bilaterally: vascular and degenerative dis- 
eases involving the motor cortical areas bilaterally, vascular 
diseases and tumors of the internal capsule or brain stem, 
degenerative diseases involving the entire corticobulbar 
tract system, and infectious diseases. Unilateral lesions can 
have the same result. 


Mixed Flaccid-Spastic (Pseudobulbar) Dysphonia: Tenth 
Cranial Nerve and Pyramidal Tract Lesions 


Dysphonias in mixed dysarthria produce the voice signs of 
each dysarthric component simultaneously. A common 
example is the mixed dysarthria in flaccid-spastic paralysis 
as exemplified by ALS. A degenerative disease of bilateral 
corticobulbar tracts and lower motor neuron nuclei, spastic 
and flaccid paralysis and associated dysarthrias occur 
simultaneously, although the proportion of one over the 
other varies from patient to patient and within a given 
patient, depending upon the stage of the disease. Should 
the paralysis be predominately spastic, hyperadduction of 
the true and false vocal folds will be seen, and the effect 
will be a primarily spastic or pseudobulbar dysphonia. If 
the disease expresses itself primarily as flaccid weakness, 
hypoadduction of one or both vocal folds will be seen on 
laryngologic examination, and the dysphonia will be pre- 
dominately flaccid. Pooling of saliva in the pyriform sinuses 
is common. 

The dysphonia of mixed, flaccid-spastic dysarthria is typi- 
cally a harsh, strained-strangled sound with degrees of 
breathiness, reduced loudness, audible inhalation, and “wet 
hoarseness.” The “wet” or gurgly sounding voice is due to the 
accumulation of saliva in the pyriform sinuses and on the 
vocal folds resulting from the reduced frequency of swallow- 
ing. Rapid tremor or flutter on vowel prolongation, also 
noted in pure flaccid dysphonia, is common. Although the 
frequency of the flaccid “flutter” is yet to be quantified by 
research, Aronson and colleagues at the Mayo Clinic esti- 
mated that the range is from 9 to 12 Hz. Figure 5.9 is a com- 
parison between the vowel prolongation of a normal subject 
and that of a patient with ALS. Note the considerable 
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Figure 5.9 Sonogram of a normal subject’s 
vowel prolongation (top) and that of a patient 
with amyotrophic lateral sclerosis (bottom). 
Note the relatively smooth tracing of the 
normal subject compared with the rapid 
undulations indicating the flaccid “flutter” of 
the patient with amyotrophic lateral sclerosis. 
(50 mm = 1 second.) 
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difference between the rapid undulations in the vowel pro- 
longation of the patient with ALS and the relatively smooth 
output of the normal subject. Table 5.5 is a summary of the 
mixed dysarthria of ALS. 

Carrow, Rivera, Mauldin, et al. (1974) analyzed and rated 
the dysarthrias of seven patients, aged 20 to 65 years, who 
had motor neuron disease, including ALS. Speech samples 
were rated on a 1 to 7 scale of severity, 1 being normal and 7 
severe (Table 5.6). Approximately 80% had harsh voice qual- 
ity, 75% hypernasality, 65% breathy voice quality, 63% voice 
tremor, 60% strained-strangled quality, 41% audible inhala- 
tion, 38% excessively high pitch, and 8% excessively low 
pitch. It is apparent that dysphonia is a prominent and inca- 
pacitating characteristic of motor neuron disease. 

The terms motor neuron disease and ALS are often used 
interchangeably, although it is preferable to use the term ALS 
only when pyramidal tract signs predominate. In patients in 
whom lower motor neuron signs are the most prominent, 
the term progressive muscular atrophy (PMA) is used. The 
peak incidence of motor neuron disease is in the fifth to sev- 
enth decades of life, with ALS occurring slightly later than 
PMA. The male to female sex ratio in ALS is 3:2, and in PMA 
it is 5:1. The disease presents with brain-stem (i.e., bulbar) 
palsy in 30% of cases and progresses more rapidly than the 
spinal form, with death usually occurring within 3 years. 
Fasciculations are seen more often in ALS. Speech distur- 
bance is considered to be the earliest sign in the bulbar type, 
according to Rose (1977), who states that the bilabial plo- 
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sives are affected first owing to facial muscle weakness, the 
patient initially noticing difficulty in whistling. Palatal weak- 
ness causing hypernasality and nasal emission is usually, but 
not always, symmetric. The mandible droops, particularly in 
the pseudobulbar variety. The tongue is flaccid and atrophic, 
and it fasciculates. 

The etiology of ALS is unknown. Theories include viral 
infection, the effect of malignancy, and in the familial form a 
genetic deficit of neuronal enzymes. An endemic form has 
been described in Guam among the Chamorro people, where 
the incidence is the highest in the world (57%). The most 
common causes of death are respiratory failure, pneumonia, 
and pulmonary infarction. 


Hypokinetic (Parkinsonian) Dysphonia: 
Basal Ganglia Lesions 


The basal ganglia have both subcortical and cortical connec- 
tions. This portion of the extrapyramidal system modulates 
nerve impulses to lower motor neurons. Damage to the sub- 
stantia nigra in the basal ganglia releases inhibition of nerve 
impulses to the lower motor neuron, causing rigidity and 
slowness of movements. When the speech muscles are 
affected, the dysarthria of parkinsonism (i.e., hypokinetic 
dysarthria) occurs (Table 5.7). 

The dysphonia of parkinsonism consists of reduced loud- 
ness and monopitch (Darley, Aronson, and Brown, 1969a,b). 
In a study of 200 parkinsonian patients undertaken by 
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Table5.5 Laryngeal-Phonatory Characteristics in the Mixed 
Dysarthria of Amyotrophic Lateral Sclerosis* 








Findings 

Perceptual 

Phonation Hoarseness or harshness having a strained- 
strangled quality; “wet” or “gurgly” 
component. Rapid tremor or “flutter” on 
vowel prolongation. Breathy if strong 
flaccid component; pitch is abnormally 
low. Monopitch. Loudness is reduced. 
Monoloudness. Inhalatory stridor if 
severe. Reduced sharpness of cough. 
Inappropriate crying or laughter may be 
present. 

Resonation Hypernasality; nasal emission 

Articulation Imprecise consonants; abnormally slow rate 

Language Normal 

Physical 

Larynx Vocal folds appear normal in structure. If 
major component is spastic, vocal folds 
appear to adduct normally or may 
hyperadduct, along with false vocal 
folds. Adduction may be bilaterally 
symmetric, or one vocal fold may adduct 
less fully than the other. If there is a 
major flaccid component, vocal folds 
may adduct and abduct with less than 
normal excursions. 

Velopharynx Bilateral velopharyngeal insufficiency, 
possibly asymmetric. Hyperactive gag 
reflex 

Tongue Topographically normal; furrowed and 
reduced in size owing to atrophy. 
Fasciculations. Weakness. Slow alternate 
motion rate (AMR) on lateral movements 
and on /tA/ and /kA/ syllable repetitions 

Lips Weak, slow movements on AMR for /pA/ 

Teeth Normal 

Hard palate Normal 

Mandible Slow AMR 

General Medical Nonspecific 

Other Neurologic 

Signs 
Peripheral Signs of flaccid paralysis 
nervous system 
Central nervous Signs of spasticity 
system 
Psychiatric/ Nonspecific. Pseudobulbar crying and 
Psychologic laughter may give erroneous impression 


of emotional lability and intellectual 
deterioration 


*Basically the same as in any other nervous system disease affecting both 
pyramidal and lower motor neuron tracts bilaterally. 


Logemann, Fisher, Boshes, et al. (1978), 178 (89%) had signs of 
laryngeal dysfunction; 30 (15%), had breathy voice quality; 58 
(29%) had roughness; 90 (45%) had hoarseness; and 27 (13.5%) 
had tremulousness of voice. The characteristically mono- 
pitched voice arises from the rigidity and reduced range of 
motion of the intrinsic and extrinsic laryngeal muscles. Pitch 
change requires that the strap muscles elevate and lower the 


Table 5.6 Abnormal Voice Dimensions in 79 Patients with 
Motor Neuron Disease (ALS) Exhibiting Abnormal Voice 


Characteristics Dimension Affected Percentage (%) 


Harsh voice 79.95 
Hypernasality 74.68 
Breathy voice 64.56 
Voice tremor 63.29 
Strained-strangled voice 59.49 
Audible inspiration 40.51 
High pitch 37.97 
Low pitch 7.59 


Source: From Carrow, E., Rivera, V., Mauldin, M., Shamblin, L. (1974). Deviant 
speech characteristics in motor neuron disease. Arch Otolaryngol 100, 212-218. 


larynx, that the vocal folds be stretched and loosened by 
cricothyroid muscle contraction and relaxation, and that 
infraglottal air pressure be increased and reduced. If the laryn- 
geal and respiratory musculature is rigid, as in parkinsonism, 
range of motion will be diminished. Controlled changes of 
infraglottal air pressure, necessary for the stress and emphasis 
of normal prosody, appear to be lacking in parkinsonian 
patients, whose respirations are shallow owing to the reduced 
range of motion of thoracic and abdominal muscle, and whose 
discrete chest pulses are diminished or absent. The loss of 
variation in emphasis and stress in contextual speech is what 
gives the overall effect of flattening of prosody. 

Clinicians need to be alert to reduced loudness and breathy 
voice quality as a sign of early hypokinetic dysarthria, even 
though remaining signs of the dysarthria and nonspeech 
signs of parkinsonism are not apparent. Such patients are fre- 
quently misdiagnosed as having “functional” dysphonia. 


Case Study 5.3 


A year before coming to the clinic, a 64-year-old minister 
had acoronary attack. He had been working exceptionally 
hard and had been under much stress trying to minister to 
a parish with insufficient funds. During the year after this 
attack, he noticed a diminution in the loudness of his 
voice and described it as “muffled.” He revealed that, in 
his opinion, he might also have had a slight “stroke,” 
because his left side seemed weak and his gait unsteady. 

Laryngoscopic examination detected little in the way of 
laryngeal pathologic changes, the only finding being 
slightly greater than normal spaces in the interarytenoid 
region of the glottis during phonation. A diagnosis of 
functional voice disorder was made. 

On closer scrutiny, however, the patient’s alternate 
motion rates on /p%/, [t^], and /k*/ appeared slightly 
accelerated and were produced with a less than full range 
of motion. The patient admitted to a slowing of his think- 
ing during the year and a change in his handwriting. 

Neurologic examination confirmed the presence of early 
parkinsonism. Although no single sign was convincing, when 
the speech, mild masking of facial expression, reduced arm 
swing, and cogwheeling of extremities were assembled, 
they argued strongly in favor of this neurologic diagnosis. 
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Table 5.7 Laryngeal-Phonatory Characteristics in the 
Hypokinetic Dysarthria of Parkinsonism 


Findings 


Perceptual 
Phonation Monopitch; reduced stress; monoloudness; 
reduced loudness; harsh voice quality; 

breathy voice quality 
Note: Reduced 

loudness and 

breathiness in the 

absence of other 

neurologic signs 

can indicate early 


parkinsonism. 
Resonation 
Articulation 


Language 


Physical 
Larynx 


Velopharynx 
Tongue 


Lips 


Teeth 
Hard palate 
Mandible 


General Medical 


Other Neurologic 
Signs 
Peripheral 
nervous system 
Central 
nervous system 


Psychiatric] 
Psychologic 


Normal 

Imprecise consonants; short rushes of 
speech; accelerated rate: stuttering-like 
repetitions of syllables, words, or phrases 
(palilalia) 

Usually normal. Language functions may 
be decreased as part of overall slowing of 
general intellectual processes. 


Vocal folds appear normal in structure. 
Adductor, abductor movements are 
bilaterally symmetric, but there may be 
incomplete closure of vocal folds, 
accounting for breathy voice quality. 

Normal 

Topographically normal; alternate motion 
rates (AMRs) for /tA/ and /kA/ sound 
rapid and are reduced in amplitude of 
movement 

AMRS for /pA/ sound rapid and are reduced 

in amplitude of movement 

ormal 

ormal 

Reduced range of motion during 

articulation 


N 
N 
Nonspecific 


Normal 





WN 


igns of hypokinesia elsewhere in the body 
Note: Hypokinetic dysarthria in the form 
of dysphonia only can be the first sign of 
early parkinsonism. 

Nonspecific. Masked facies may give the 


erroneous impression of flatness of affect 
or of depression. 


Motion picture studies of the vocal folds during phonation 
in 30 patients with Parkinson’s disease disclosed a direct 
relationship between breathiness, and reduced loudness, 
and increasing amounts of glottic gap and bowing of the 
vocal folds (Hanson, Gerratt, and Ward, 1984). 

Physical measurements of respiration in hypokinetic 
dysarthria prove that such patients have shallow irregular 
breathing, pause between respirations, and experience peri- 


ods of increased depth of respiration that alternate with 
periods of respiratory arrest. Postencephalitic parkinsonism 
patients have shallow respiration of twice the normal fre- 
quency; negligible differences between vegetative, deep, and 
speech breathing; smaller than normal vital capacities; air 
wastage before speaking; and exhalations repeatedly inter- 
rupted by small inhalations (Darley, Aronson, and Brown, 
1975). 

The results of studies of frequency and intensity of parkin- 
sonian patients are equivocal. Canter (1963) found that 
parkinsonism patients read orally at a median fundamental 
frequency of 129 Hz, which was statistically significantly 
higher than the median frequency of 102 Hz found in a con- 
trol group of normal subjects. A similar increase in funda- 
mental frequency was found by Kammermeier (1969), but 
Schilling (cited by Darley, Aronson, and Brown, 1975), on the 
other hand, found pitch range to be considerably decreased. 
The results of studies of vocal intensity, or loudness, in 
parkinsonism have demonstrated that the lower-intensity 
levels exhibited by these patients can be significantly ele- 
vated with intensive voice treatment (Lee Silverman voice 
treatment LSVT) and maintained at the louder level. On aver- 
age, patients treated with LSVT raised their vocal intensity 
level for reading 9 dB SPL and that intensity decreased by 
only 3 dB SPL (sound pressure level) when tested 2 years 
after treatment was terminated (Ramig, Sapir, Countryman, 
et al., 2001). 


Arrhythmically Fluctuating, Neurologic 
Voice Disorders 


Ataxic Dysphonia: Cerebellar Lesions 


The cerebellum coordinates muscles and regulates skilled 
movements. Contained within the skull’s posterior fossa, the 
cerebellum consists of an anterior lobe, a posterior lobe, and 
a flocculonodular lobe. It connects with the brain stem and 
structures above and below it by means of incoming and out- 
going fibers that pass through the inferior, middle, and supe- 
rior cerebellar peduncles to brain-stem nuclei, the dentate 
nucleus, the thalamus, and the cerebral cortex. Of the many 
functions of the cerebellum, including equilibrium, posture, 
and gait, the posterior lobes are particularly concerned with 
voluntary movements. Lesions in this area result in a loss of 
muscle coordination, known as dyssynergia; a loss of the 
ability to measure range of motion, known as dysmetria; and 
tremor during voluntary movement, known as intention 
tremor. General movements are clumsy and uncoordinated 
(Daube, Sandok, Reagan, et al., 1978). 

Whether or not dysphonia is present may depend upon 
the severity of the ataxia. The dysphonia may take one of 
several forms: sudden bursts of loudness, irregular increases 
in pitch and loudness, or coarse voice tremor (Table 5.8). 


Hyperkinetic (Choreic) Dysphonia: Basal Ganglia Lesions 


Quick, jerky, irregular, and unpredictable movements caused 
by lesions of the basal ganglia, probably the caudate nucleus, 
are found in Sydenham’s chorea in children and Huntington’s 
chorea, the hereditary type, in adults. In addition to the 
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Table 5.8 Laryngeal-Phonatory Characteristics 
in Ataxic Dysarthria 


Findings 

Perceptual 

Phonation Frequently normal. Others have harsh 
voice quality, monopitch, 
monoloudness, excess, and equal stress 
on ordinarily unstressed words or 
syllables, excess loudness, bursts of 
loudness, and coarse voice tremor. 

Resonation Normal 

Articulation Imprecise consonants; irregular 
articulatory breakdown; distorted 
vowels; slow rate 

Language Normal 

Physical 

Larynx Vocal folds appear normal in structure and 
function 

Velopharynx Normal 

Tongue Topographically normal; irregular and slow 
alternate motion rates (AMRs) on /tA/ 
and /kA/ and on all lateral tongue 
movements 

Lips Irregular and slow AMRs for /pA/ 

Teeth Normal 

Hard palate Normal 

Mandible Normal 


General Medical Nonspecific. Note: Ataxic dysarthria may be 
a sign of moderate to severe hypo- 


thyroidism and drug and alcohol abuse. 
Other Neurologic 


Signs 

Peripheral nervous Normal 
system 

Central nervous Signs of ataxia 
system 


Psychiatric] 
Psychologic 


Nonspecific 


sudden uncontrolled movements of the lips, tongue, and 
mandible, movements of the respiratory and laryngeal mus- 
culature produce irregular pitch fluctuations and voice 
arrests, giving speech a jerky quality. In a study done by Dar- 
ley, Aronson, and Brown (1969a,b), the following phonatory 
aberrations were found: sudden forced inspiration or expira- 
tion; harsh voice quality; excess loudness variations; 
strained-strangled phonation; monopitch; monoloudness; 
reduced stress; transient breathiness; and voice arrests. 
Table 5.9 summarizes the hyperkinetic dysarthria of chorea. 


Hyperkinetic (Dystonic) Dysphonia: Basal Ganglia Lesions 


A slower form of hyperkinesia known as dyskinesia or dysto- 
nia is manifested by repetitive, slow, twisting, writhing, or 
flexing movements of the musculature. When only the 
speech musculature is affected, the terms orofacial dyskine- 
sia or focal mouth dystonia are employed. In addition to vas- 
cular, degenerative, or mass lesions, this movement disorder 
can result from prolonged use of psychotropic and 
antiparkinsonian drugs. In cases caused by drug therapy, the 


Table 5.9 Laryngeal-Phonatory Characteristics in the 
Hyperkinetic Dysarthria of Chorea 








Findings 
Perceptual 
Phonation Intermittently harsh, strained-strangled 
voice quality; transient breathiness; 
distorted vowels; monopitch; excess 
loudness variations; monoloudness; 
excess, equal stress on ordinarily 
unstressed words or syllables; reduced 
stress; sudden forced inspiration/ 
expiration 
Resonation Intermittent hypernasality 
Articulation Imprecise consonants; distorted vowels, 
prolonged intervals between syllables 
and words; variable rate; inappropriate 
silences; prolonged phonemes; short 
phrases; irregular articulatory 
breakdown 
Language Normal; defective in patients who have 
undergone intellectual deterioration 
Physical 
Larynx Vocal folds appear normal in structure, 
intermittent hyperadduction 
Velopharynx Normal in appearance 
Tongue Topographically normal; quick 
unpatterned movements at rest and 
during alternate motion rates (AMRs) for 
[tA] and /kA/, which are irregular 
Lips Quick and unpatterned movements at rest 
and during AMRs for /pA/ 
Teeth Normal 
Hard palate Normal 
Mandible Quick asymmetric movements at rest and 
during speech 
General Medical Nonspecific 
Other Neurologic 
Signs 
Peripheral nervous Normal 
system 
Central nervous Choreic movements elsewhere in the body 
system 


Psychiatric} 
Psychologic 


Intellectual and behavior changes 
associated with dementia, if present 


dyskinesia does not appear until after prolonged use, and so 
the term tardive dyskinesia is used in reference to the late 
(tardy) appearance of the dyskinesia (Portnoy, 1979). 

In addition to rounding, pursing, protruding, and lateraliz- 
ing lip movements and tongue protrusions, rotation, and lat- 
eralization, similarly uncontrolled adductor and abductor 
laryngeal spasms cause strained hoarseness and breathiness, 
and sometimes paroxysmal inhalatory stridor. A form of dys- 
tonia involving the entire body is dystonia musculorum 
deformans. This disorder can produce profound laryngeal 
spasms that cause a highly strained or groaning dysphonia of 
a waxing and waning character. 

Athetosis, also a slow hyperkinesia that is commonly seen 
as one form of cerebral palsy, produces similar strained 
hoarseness and breathiness in addition to the writhing 
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movements of the head, neck, torso, and extremities. In a 
study done by Darley, Aronson, and Brown (1969a,b), 
patients who had dystonia gave evidence of harsh, strained- 
strangled voice; excess loudness variations; voice arrests; 
short phrases; monopitch, monoloudness; reduced stress; 
inappropriate silences; and excess and equal stress on all syl- 
lables of words. Table 5.10 summarizes the hyperkinetic 
dysarthria of dystonia. 


Rhythmically Fluctuating, Neurologic 
Voice Disorders 


Organic (Essential) Voice Tremor 


Voice tremor is a component of the essential tremor syndrome, 
called benign heredofamilial tremor when familial. The voice 
tremor can be monosymptomatic without evidence of tremor 


Table5.10 Laryngeal-Phonatory Characteristics in the 
Hyperkinetic Dysarthria of Dystonia 


Findings 

Perceptual 

Phonation Slow, continuous changes in strained- 
hoarse quality; breathiness; excess; 
loudness variations; voice arrests; 
monopitch; monoloudness; reduced 
stress; excess and equal stress on 
ordinarily unstressed syllables and words 

Resonation Normal 

Articulation Imprecise consonants; distorted vowels, 
short phrases; inappropriate silences 

Language Normal. May be defective if dysarthria is 
associated with focal language disorders 
of diffuse intellectual disorders 

Physical 

Larynx Vocal folds appear normal in structure and 
function; intermittent hyperadduction 

Velopharynx Normal 

Tongue Topographically normal; slow, unpatterned 
protrusive, lateral, and rotatory 
movements at rest and during speech. 
Alternate motion rates (AMRs) on /tA/ 
and /kA/ are slow and highly irregular. 

Lips Slow unpatterned lip rounding and 
spreading. AMRs for /pA/ are slow and 
highly irregular. 

Teeth Normal 

Hard palate Normal 

Mandible Slow unpatterned depression, 


General Medical 


Other Neurologic 
Signs 

Peripheral nervous 
system 

Central nervous 
system 

Psychiatric] 
Psychologic 


lateralization, and elevation 
Nonspecific 





Normal 


Signs of dystonia may be confined to larynx or 


may occur elsewhere in the body 


Nonspecific. Intellectual and behavioral 


aberrations present if diffuse central nurvous 
system disease is present 


elsewhere in the body or can accompany the full syndrome of 
tremor of the upper limbs, head, face, or neck musculature. The 
rate of the tremor ranges most often from 4 to 7 Hz, and this 
tremor must be distinguished from those associated with 
parkinsonism, cerebellar disease, thyrotoxicosis, and anxiety. 
Normal humans have small tremor amplitudes of 6 to 12 Hz at 
rest and during intentional movements (Brumlik, 1962). Nor- 
mal physiologic tremor has been reported in children at 5 or 
6 Hz and in young adults at 10 Hz, the change in rate taking 
place during puberty (Marshall, 1962). After about age 40 
years, the tremor begins to decline from 10 to 6 Hz. 

No uniform location of the lesion in the nervous system 
has been found in organic voice tremor. In his wide-sweep- 
ing review of the literature, Critchley (1949) documented 
many brain-stem lesions causing tremor; caudate nucleus, 
putamen; loss of cells in the cerebellum and dentate nuclei; 
and loss of cells within the triangle connecting the red 
nucleus, dentate nucleus, and inferior olive. Essential tremor 
can produce three kinds of effects on voice: 


1. If the alternating contractions of the adductors and 
abductors of the vocal folds are of equal strength, typical 
organic voice tremor will occur. 


2. If the adductor movements are disproportionately 
stronger than abductor, the vocal folds will meet momen- 
tarily in the midline causing voice arrests, producing 
adductor spasmodic dysphonia of essential tremor (see 
Chapter 6). 


3. If the abductor movements are disproportionately 
stronger than adductor, the vocal folds will overabduct 
causing breathy air release, producing abductor spas- 
modic dysphonia of essential tremor (see Chapter 6). 


The onset of organic voice tremor can be gradual or sud- 
den. Brown and Simonson (1963) found that 5 of 27 patients 
had rapid onset of voice tremor: one while reciting the Lord’s 
Prayer, a second while singing in church, and a third while 
talking at a board meeting. The other two noted abrupt onset 
of hoarseness and a strained feeling in the throat while 
talking, and, as these symptoms cleared, the voice tremor 
appeared. That these patients were older than the age of 50 
years led the researchers to conclude that the rapid onset 
was due to occlusive vascular disease. 

The voice of an individual with an organic tremor has a 
characteristically quavering intonation because of tremor of 
the laryngeal muscles and often of the articulatory and res- 
piratory muscles as well. Severe cases have a staccato 
abruptness of voice heard as rhythmic voice arrests. The 
tongue also moves in a rhythmic tremor both at rest and on 
protrusion. 

Clinicians should remember that voice tremor can occur 
without associated tremors elsewhere in the body. Brown 
and Simonson (1963) found voice tremor to be the only 
tremor in 6 of 23 patients. For the entire group, the voice 
tremor ranged from 4 to 8 Hz, although in 16 to 23 cases, the 
tremor frequency was concentrated in the 5- to 6-Hz range. 

The voice tremor in organic tremor is best demonstrated 
during vowel prolongation. The tremor can be smooth or 
interrupted by the previously noted voice stoppages or vocal 
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arrests. These voice arrests are caused by momentary 
obstruction of the exhaled air stream due to complete glottic 
closure simultaneous with vertical movements of the larynx. 

A mild organic voice tremor is often masked by contextual 
speech. As Brown and Simonson noted in their study of 31 
patients with this disorder, short-duration vowels were 
uttered without perceptible impairment, but the tremor 
became obvious during production of sounds of longer dura- 
tion. It is of clinical importance to know that the severity of 
the tremor is greater during emotional stress and fatigue. 
Patients show little in the way of psychoneuroses, but many 
cases have occurred after acute emotional stress. Table 5.11 
is a summary of the dysarthria of organic voice tremor. 


Palatopharyngolaryngeal Myoclonus 


Palatopharyngolaryngeal myoclonus refers to rhythmic or 
semirhythmic movements of the soft palate, pharyngeal 
walls, laryngeal musculature, eyeballs, diaphragm, and 
tongue. Technically, it is a slow form of tremor; myoclonus 
may be a misnomer. These signs may exist in isolation or in 
various combinations. When the soft palate is involved, 
abrupt, rhythmic, anteroposterior, and vertical movements 
are present during speech and at rest. Pharyngeal muscle 
contractions can open and shut the eustachian tube, produc- 
ing a bruit or clicking sound transmitted by the tympanic 
membrane and sometimes heard by others at a fair distance 
at the same rate and rhythm of the myoclonic contractions. 


Case Study 5.4 


A 51-year-old male came to the clinic because of spas- 
modic blinking of his eyelids and right arm clumsiness 
after attacks of influenza. He noted a clicking sensation in 
his larynx that seemed to obstruct his breathing, and he 
complained of episodes of aphonia, breathy dysphonia, 
and a sensation of spasm in the larynx. At times he would 
gasp for air on inhalation. He had frequent contractions of 
the orbicularis oculi. He had slight bilateral sensory and 
motor paralysis of the pharyngeal muscles. Also, although 
he had some adductor weakness of the vocal folds, the 
outstanding feature noted on laryngoscopic examination 
was constant, involuntary, rhythmic bilateral adductor 
movements of the vocal folds that occurred 60 to 80 
times per minute and frequently interfered with inspira- 
tion by producing complete adduction before inhalation 
had been completed. He also had myoclonic twitching of 
the facial muscles synchronous with movements of the 
vocal folds. The pharyngeal muscles and soft palate did 
not participate in these movements. The disorder was 
diagnosed as postencephalitic myoclonic movements of 
the larynx and pharynx (Childrey and Parker, 1931). 


Rhythmic adductor movements of the vocal folds, and 
gross upward and downward movements of the larynx, 
cause momentary, rhythmic, phonatory interruptions. How- 
ever, because each myoclonic movement is brief, voice inter- 
ruptions are rarely audible during contextual speech though 
they can be easily seen on kymographic and high-speed 


Table 5.11 Laryngeal-Phonatory Characteristics in the 
Hyperkinetic Dysarthria of Organic (Essential) Tremor 


Findings 


Perceptual 
Phonation Quavering or intermittent voice arrests 
during contextual speech. Rhythmic 
tremor and/or voice arrests on vowel 
prolongation ranging from ~4 to 7 Hz. 
Note: In severe organic voice tremor, 
the voice arrests take the form of severe 
larygospasm that may be mistaken for 
the syndrome of spastic (spasmodic) 
dysphonia. Patients with voice arrest 
will show a smoothing out of these 
arrests into ordinary tremor when 
sustaining a vowel at a high pitch 
level. Fluctuating strained-hoarseness 
along with tremor in more severe 
cases 
Normal 
Normal. May have irregular articulatory 
breakdowns reminiscent of ataxic 
dysarthria 
Normal 


Resonation 
Articulation 


Language 

Physical 

Larynx Vocal folds appear normal in structure. On 
vowel prolongation, adductor-abductor 
oscillations synchronous with voice 
tremor can be seen as well as 
pharyngeal wall movements. Tremor 
movements of the larynx can be seen 
under skin of neck, with the larynx 
oscillating vertically. Voice arrests 
occur at the maximum laryngeal height 
of each oscillation. 

Normal. Soft palate may move 
synchronously with laryngeal tremor 

Topographically normal; on vowel 
prolongation, tremor movements of 
tongue may be seen synchronously 
with laryngeal tremor. 

Lips Normal, may be tremorous 

Teeth Normal 

Hard palate Normal 

Mandible Normal, may be tremorous 


General Medical 


Velopharynx 





Tongue 


Nonspecific 


Other Neurologic 
Signs 
Peripheral nervous 
system 
Central nervous Head and hand tremor, unilateral or 
system bilateral, may be present 
Note: Organic voice tremor may occur 
as an isolated sign, which is prone to 
misinterpretation as being 
psychogenic. 





Normal 


Psychiatric] 
Psychologic 


Many patients report onset of voice, head, 
or hand tremor after an emotionally 
stressful life event. There is a danger of 
interpreting such events as proof that 
the tremor is psychogenic. 
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images. They become perceptually apparent, however, during 
vowel prolongation, when one hears momentary voice arrests 
ranging among patients from 60 to 240 beats per minute. In 
rare instances, the laryngeal myoclonus can be so severe that 
abrupt, strained voice arrests are produced giving the impres- 
sion of adductor spasmodic dysphonia. 

It is easy to miss this disorder by failing to test for it by 
means of vowel prolongation, because the voice arrests are 
usually masked by contextual speech. Also, oral examination 
of the pharyngeal and palatal musculature at rest is impera- 
tive to detect the myoclonic movements. Scrutiny of the exter- 
nal surfaces of the neck is important to note subcutaneous 
laryngeal movements. Subjective signs of the syndrome are a 
clicking sensation in the larynx and a sensation of laryngeal 
spasm. Myoclonic movements can be unilateral or bilateral. 

Although the syndrome is found in people of all ages, it is 
most common in those older than the age of 50, and it is 
most often of vascular origin. In an unpublished study of 28 
Mayo Clinic patients, 13 had brain-stem or cerebellar infarc- 
tion, 8 had a tumor of the cerebellum or fourth ventricle, 5 
had head trauma producing posterior fossa damage, and 2 
had degenerative CNS disease, including cerebellar atrophy 
in 1. The site of the lesion varies, although the general region 
is the brain stem, specifically the dentate nucleus, inferior 
olive, superior cerebellar peduncle, red nucleus, and resti- 
form body. Additional neurologic signs of a brain-stem lesion 
can accompany palatopharyngolaryngeal myoclonus. Table 
5.12 summarizes the dysarthria of palatopharyngolaryngeal 
myoclonus. 


Paroxysmal Neurologic Voice Disorders 
Gilles de la Tourette’s Syndrome 


Georges Gilles de la Tourette (1885) first described the syn- 
drome that bears his name in a report of nine patients with 
multiple tics and involuntary vocalizations that included 
coprolalia (the uncontrolled utterance of socially unaccept- 
able language having to do with sexual and other bodily 
functions), and echolalia. Isolating the movements and 
sounds in this unusual disorder is difficult because they 
blend into appropriate cultural, environmental, and psycho- 
logic contexts involving speech and throat-clearing. A vari- 
ety of involuntary movements have been observed: jumping, 
squatting, skipping, hitting, kicking; repetitive movements, 
for example, touching the floor; hesitating to pick up objects 
as if they were hot; and touching chin, lips, tongue, throat, 
and other people. Other complicated movements include 
startle reactions, esophageal spasm, and echopraxia. 

Speech and nonspeech vocalizations include grunts, bark- 
ing, coughing, throat-clearing, echolalia, and coprolalia. 

In a study done by Shapiro, Shapiro, and Wayne (1973), of 
34 patients, 27 male and 7 female, the average age of onset 
was 7.4 years, ranging from 3 to 13 years. The age of onset is 
difficult to determine and is probably reported considerably 
later than the actual date. 

According to Cohen, Shaywitz, Caparulo, et al. (1978), a 
neurologic basis for Gilles de la Tourette’s syndrome is 
strongly supported by the high incidence of abnormal elec- 
troencephalographic and other neurologic findings and a 


Table 5.12 Laryngeal-Phonatory Characteristics in the 
Hyperkinetic Dysarthria of Palatopharyngolaryngeal 
Myoclonus* 


Findings 


Perceptual 

Phonation Momentary voice arrests during 
contextual speech if severe, but 
often undetectable under this 
condition. On vowel prolongation, 
momentary voice arrests occur 
rhythmically ranging from 60 to 
240 beats per minute (1 to 4 Hz). 
Note: Because often undetectable 
during contextual speech, vowel 
prolongation must be tested in all 
suspected cases. 

Normal 

Normal. Patient may have 
articulatory defects or flaccid, 
spastic, or ataxic dysarthria. 

Normal 


Resonation 
Articulation 


Language 

Physical 

Larynx Vocal folds adduct rhythmically and 
momentarily on vowel prolonga- 
tion, synchronously with voice 
arrests. Myoclonic movements of 
larynx and pharynx can be seen by 
observing the movements 
beneath skin of the neck. 

Soft palate elevates and falls, and 
lateral pharyngeal walls adduct 
and abduct synchronously with 
above structures 

Tongue Normal. May enter into myoclonic 

movements in synchrony with 

above structures 


Velopharynx 








Lips Norma 
Teeth Norma 
Hard palate Norma 
Mandible Nonspecific 
General Medical Nonspecific 
Other Neurologic Signs 

Peripheral nervous system Normal 


Central nervous system Other signs of brain-stem lesion 


may be present or absent 





Psychiatric]Psychologic Normal 


*To detect presence of this syndrome, observation of the oral musculature while 
patient is quiet and holding mouth open as steadily as possible is imperative; 
myoclonic movements are present at rest as well as during phonation. 





history of difficult birth. Although only 3 of 15 patients stud- 
ied by Golden (1977) had major personality problems, 11 of 
15 had difficulties with school work, 8 having learning dis- 
abilities. Only 4 of 15 had made a completely normal adjust- 
ment to school. 

A 17-year-old male wrote the following letter: 


I think I have that Tourette’s syndrome. It’s crazy. All 
these years I have been trying to cure it myself, trying to 
control all those nervous mannerisms that drove me up the 
wall. Like stuttering, my eye twitching, coughing, shaking, 
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whatever. I said, “How am I ever going to become a dancer 
or a lawyer if I can’t get up in front of a crowd?” You know 
what this syndrome does? It prevents people from being 
comfortable or being themselves so they just clam up and 
don’t take that risk of being uncomfortable. They begin to 
reject themselves for what they are then. It’s not like being 
fat or having crooked teeth, you can control those but this 
is something you have absolutely no control over .. . there 
is nowhere to run, it is inside you, with you, and you can’t 
leave it behind. 


Patients with Gilles de la Tourette’s syndrome respond 
well to haloperidol. Abuzzahab and Anderson (1974), in their 
review of drug treatment of the disease, found that 89% of 
patients improved and 67% maintained the improvement 
longer than 6 months, although there was no evidence that 
treatment alters the eventual outcome of the disease. Table 
5.13 is a summary of the dysarthria of Gilles de la Tourette’s 
syndrome. 


Neurologic Voice Disorders Due to Loss of Control 
over Volitional Phonation 


Apraxic Aphonia and Dysphonia 


Apraxia of phonation and respiration for speech along with 
apraxia of articulation results from lesions of Broca’s area in 


Table 5.13 Laryngeal-Phonatory Characteristics in the 
Hyperkinetic Dysarthria of Gilles de la Tourette’s Syndrome 


Findings* 

Perceptual 

Phonation Involuntary grunting, coughing, 
throat-clearing, barking, 
squealing, shrieking, 
screaming, gurgling, moaning 

Resonation Snorting, sniffing 

Articulation Whistling; clicking; lip-smacking; 
spitting; stuttering-like 
repetitions of sounds 

Language Echolalia, coprolalia 

Physical 

Larynx Vocal folds appear normal in 
structure and function 

Velopharynx Normal 

Tongue Normal 

Lips Normal 

Teeth Normal 

Hard palate Normal 

Mandible Normal 

General Medical Nonspecific 

Other Neurologic Signs 





Peripheral nervous system Normal 


Central nervous system 
Psychiatric/Psychologic 


Jerky bodily movements 


Emotional behavioral problems 
secondary to adverse social 
effects of above 


*Findings vary among patients; not all are found in a given patient. 


the dominant cerebral hemisphere. More than one type of 
phonatory manifestation of apraxia of phonation and respi- 
ration occur: (1) no articulatory movements or laryngeal 
sound, voiced or unvoiced; (2) articulation produced with an 
unphonated air stream (i.e., whispered speech); (3) articula- 
tory movements without accompanying exhalatory activity, 
patients being unable to inhale or exhale voluntarily on com- 
mand or in response to imitation, even though they are able 
to do so reflexly for vital purposes. They are unable to cough 
volitionally, showing instead confusion and trial-and-error 
efforts, yet they are able to cough automatically. When asked 
to cough, they will sometimes say the word cough rather 
then produce the cough itself. See Table 5.14 for a summary 
of laryngeal-phonatory signs in apraxia of speech. 


Table 5.14 Laryngeal-Phonatory Characteristics in Apraxia 
of Speech 


Findings 


Perceptual 
Phonation Varies, from normal in some patients 
to mutism in others. Phonation 

may be impossible because of 
apparent loss of recall for 
integration of respiratory and 
laryngeal movements, resulting in 
trial-and-error efforts to phonate, 
but silent nevertheless. Aphonic 
(whispered) speech can occur. 
Inability to cough volitionally 
or clear throat 
Resonation Normal 
Articulation Phoneme omissions, substitutions, 
reversals, and additions; 
stuttering-like blocking 
Language May be relatively normal or aphasic 
Physical 
Larynx Vocal folds appear normal in 
structure and function 
Velopharynx Normal 
Tongue Topographically normal. May have 
associated dysarthric signs. Trial- 
and-error nonspeech volitional 
movements (oral nonverbal 
apraxia) 
Lips Trial-and-error nonspeech volitional 
movements may be present. 
Sequential motor rates, the rapid 
sequencing of sounds from /pA/ 
to [tA] to /kA/, may be mildly to 
severely impaired. 
Teeth Norma 
Hard palate Norma 
Mandible Norma 


General Medical Nonspecific 


Other Neurologic Signs 

Peripheral nervous system Norma 

Central nervous system Awide variety of dysarthric, apraxic, 
and other abnormal signs may 
coexist. 





Psychiatric/Psychologic Nonspecific 
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Akinetic Mutism 


Deep and widespread cerebral and brain-stem lesions can 
cause (1) muteness and failure to respond to questions or 
comments; though patients are alert and have eye contact, 
they are indifferent to the examiner. These clinical behaviors 
have been noted in presenile and senile dementia 
(Alzheimer’s and Pick’s diseases), hydrocephalus, frontal 
lobe lesions and brain-stem trauma, and after seizures in 
children. 

The akinetically mute patient sits with eyes open, per- 
forms no or few voluntary movements, is unresponsive, 
looks away from the examiner, may have sucking move- 
ments, and is mute. The mutism is relative (i.e., mute 
except when prodded, in which case the patient will speak, 
usually normally, if no associated dysarthria, apraxia, or 
aphasia is present). A spectrum of severity must be taken 
into consideration; a given patient may be mute all the 
time early in the recovery period, may be mute only some 
of the time later on, and may show only vestiges of mutism 
during advanced stages of recovery by nothing more 
unusual than prolonged latency of speech response. Aki- 
netic mutism can result from anoxia, metabolic diseases, 
subdural hematomas with compressions, cerebrovascular 
accident, or tumors, and may be seen in advanced parkin- 
sonism and after thalamotomy. 

The original description by Cairns, Oldfield, Pennybacker, 
et al. (1941) of the akinetically mute patient follows: 


The patient sleeps more than normally, but is easily roused. 
In the fully developed state he makes no sound and lies 
inert, except that the eyes regard the observer steadily, or 
follow the movement of objects, and they may be diverted 
by sound. Despite a steady gaze, which seems to give prom- 
ise to speech, the patient is quite mute, or answers in whis- 
pered monosyllables . .. commands may be performed in a 
feeble, slow and incomplete manner, but usually there are 
no movements of a voluntary character, no restless 
movements, struggling, or evidence of negativism. Emo- 
tional movement also is almost in abeyance. A painful 
stimulus produces reflex withdrawal of the limb, and if the 
stimulation is sustained, slow, feeble voluntary movement 
of the limbs may occur in an attempt to remove the source 
of stimulation, but usually without tears, noise, or other 
manifestations of pain or displeasure. The patient swal- 
lows readily, but has to be fed . . . fluctuations may occur in 
the intensity of this state. In its incomplete manifestations, 
the patient may respond at times, though slowly and 
imperfectly by speech and voluntary movement . . . incon- 
tinence persists and there is little or no trace of sponta- 
neous activity or speech. 


Klee (1961), in his review of the literature, summarized 
the varied clinical behaviors of many patients with akinetic 
mutism: 


Patients remained in akinetic and mute states for periods 
ranging from weeks to months, then gradually began to 
follow movements in their vicinity with their eyes but 
without interest ... consciousness clear, no disturbances of 
orientation, memory or powers of observation; always pos- 
sible to make contact although required much patience... 


able to answer questions adequately but in monosyllables 
and apparently with difficulty . . . trance-like states .. . 
episodes during which the patient would, in the midst of 
household activities, suddenly become inert and unrespon- 
sive and as suddenly, about ten minutes later would 
resume where she had left off. 


Daly and Love (1958) described a 14-year-old boy who had 
an astrocytoma of the upper part of the fourth ventricle, 
extending around the lower end of the aqueduct, with the 
ventricular system cephalad to the tumor being greatly 
dilated. After the operation, he developed a condition in 
which speech and movements were inhibited, although he 
could understand what happened around him and what was 
said to him. He began to speak 34 days after the operation, at 
first using only monosyllabic words and short sentences, and 
movements became gradually less inhibited during the lat- 
ter part of a 2-month observation. Klee (1961) expands on 
the point that the degree of inhibition changes from one 
minute to the next and that the duration of inhibition can be 
minutes to months: 


The patient may be almost totally immobile and mute or 
may have only a suggestion of hampering of speech and 
movement. In the most inhibited states, the appearance 
and movements of the eyes may be the only indication that 
the patient is conscious. If he speaks, it is often only after a 
long latent period, and then with monosyllabic words or 
short sentences, in a monotonous voice. Movements are 
slow, and they show a similar long latent period; sponta- 
neous activity may be less hampered than the voluntary. 
Variations in the degree of inhibition may occur from one 
minute to the next, and the inhibition may last for minutes 
or months. The patient’s experience of the inhibition of 
speech seems to be that they know what they wish to say 
but that the words cannot be said aloud. They do not 
appear to have a clear idea of how inhibited they actually 
are. 


A summary of the lesions that have caused akinetic 
mutism follows: tumors of the third ventricle; hemangioma 
of the mesencephalon and in and around the third ventricle; 
basilar artery thrombosis; Wernicke’s encephalopathy; 
encephalitis; and bullet wounds through the frontal lobes. 


Frontal Lobe Dysphonia, Aphonia, and Mutism 


Severe breathiness, aphonia, and mutism are common in 
patients who have had frontal lobe lesions, most commonly 
from traumatic head injuries, tumor, or vascular disease. It 
has been speculated that such disturbances in phonation are 
associated with affective or personality changes from dam- 
age to the limbic system and its cortical and subcortical con- 
nections, which are thought to be important in regulating 
affect and emotion and their vocal expression. These 
patients demonstrate generally reduced drive and apathy. In 
their review of a large number of studies of the effects of 
frontal lobe lesions on phonation, Sapir and Aronson (1985) 
concluded: 


These reports argue that damage or disturbance to the lim- 
bic system and its neocortical and subcortical connections 
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has profound effect on emotions and affect, vocal-facial 
expressions, and the conscious monitoring of these expres- 
sions. Specifically, such neural disturbances are said to be 
responsible for apathy, a “flat” affect and lack of drive, 
aphonic or hypophonic vocalization . . . and poor insight 
into the inappropriateness of one’s vocal and nonvocal 
behaviors. 


Nevertheless, there always exists the potential for a patient 
who has a neurologic disease to develop a psychogenic voice 
disorder that can be misinterpreted as neurologic by associa- 
tion with the neurologic disorder. In four patients who had 
central and peripheral nervous system disease described by 
Sapir and Aronson (1987), their dysphonias turned out to be 
psychogenic because of time-related psychosocial problems. 
These experiences taught the authors the importance of not 
automatically assuming that all dysphonias associated with 
neurologic disease are neurologic, and that the clinician 
always needs to be sensitive to the possibility of a dysphonia 
triggered by the anxiety or depression over the neurologic 
disease itself or by personal and family problems generated 
by illness and hospitalization. 


Foreign Accent Syndrome 


Foreign accent syndrome is an acquired speech disorder 
perceived by others as if the person is speaking his or her 
native language with a foreign accent or dialect. This impres- 
sion is caused by segmental and suprasegmental deviations 
in pronunciation; these foreign accent syndrome patients 
develop their problem after either cerebral vascular accident 
or other CNS lesion. The syndrome can occur in association 
with dysarthria, apraxia of speech, or aphasia. The main 
speech changes are suprasegmental pitch, stress, and 
rhythm patterns. Phonetic and phonemic aberrations can be 
found in their speech; shortening of vowels, insertion of 
schwa vowels, and dentalization of lingua-alveolar conso- 
nants. Not all listeners agree on which foreign language the 
speaker’s accent represents. Most cases are associated with 
left precentral gyrus lesions. 


The Case of Astrid L. 


On September 6, 1941, during an air raid over Oslo, Norway, a 
30-year-old woman named Astrid L. was struck in the head 
by a shell fragment that took away her left frontal bone, 
exposing extruded, lacerated brain. Upon recovery of full flu- 
ency, she sounded as if she were speaking her native Norwe- 
gian with a German accent. This strange development 
proved embarrassing for her, as her country was at war with 
Germany. She complained bitterly about being taken for a 
German in shops, where clerks refused to sell her merchan- 
dise. Yet she had never left Norway and had had no contact 
with the German language or with German-speaking people 
at any time in her life. 

That a person could, as result of a head injury, seem to be 
speaking in a foreign accent is difficult for serious students of 
communicative disorders to accept. Dialects traditionally are 
considered learned behavior. The pronunciation of consonants 
and vowels, the stress patterns placed on syllables of words, 


Table 5.15 Pseudo-Foreign Dialect 


Number of Cases Reported, 1907-1978 


Mayo Clinic 13 
Other 12 
Total 25 
Number Percentage (%) Age (Years) 
Females 16 64 27-59 
Males 9 36 29-71 


and the pitch-inflectional patterns superimposed on syntax 
follow logical laws of learning. Astrid was decidedly an enigma 
to her neurologist, Monrad-Krohn (1947), who reported that 
she had undergone a complete change in the melody of her 
language. She overemphasized final pronouns, and her pitch 
would rise on such pronouns at places where they should fall. 
She failed to blend the final sounds of words with the initial 
sounds of succeeding words. Dysprosody was the term he used 
to describe aberrations of normal pitch stress patterns such as 
this, analogous to what American phoneticians refer to as into- 
nation and stress, what the French call chanson de parler, and 
what the Germans refer to as sprach-melodie. 

Before Monrad-Krohn’s report, in 1907 Pierre Marie had 
described a Frenchman who began to speak in an Alsatian 
dialect after a cerebrovascular accident. In 1919, Arnold Pick 
reported a 29-year-old Czechoslovakian who began to speak 
in a Polish dialect, also after a cerebrovascular accident, right 
hemiparesis, and aphasia. A search of the literature uncov- 
ered 25 such cases noted between 1907 and 1978, 13 from 
Mayo Clinic files; 12 from other sources (Table 5.15). Sixteen 
(65%) were females aged 27 to 59 years, and 9 (36%) were 
males aged 29 to 71 years. One patient had shifted from 
French to Alsatian, one from Czech to Polish, one from Nor- 
wegian to German, three from British English to Welsh, and 
one from British English to French. From American English, 
four switched to German, two to Swedish, two to Norwegian, 
one to Spanish, two to “New England,” one to Welsh, Scot- 
tish, or Irish, and three to Italian. Three were difficult to clas- 
sify. It is interesting that 10 patients (40% of the group) had a 
shift to dialects that could be described etymologically as 
Germanic (German, Swedish, or Norwegian). 

The onset of foreign dialect was associated with neuro- 
logic disease. Fourteen (56%) developed a dialect after a 
cerebrovascular accident; 6 (24%) after head trauma; and 5 
(20%) had histories of seizures after cerebrovascular accident 
or brain trauma. An effort was made to determine the num- 
ber of patients whose foreign dialect was embedded in or 
followed more classic dysarthria, apraxia, or aphasia. Seven- 
teen (68%) had one or more of these neurologic speech lan- 
guage signs in addition to the foreign dialect. In 8 (32%) it 
was impossible to determine the presence of these associ- 
ated signs. These unusual cases raise several questions: 


Are these so-called dialectal changes truly dialectal or do 
they only sound as if they are because the direction of 
defect is coincidentally the same as one finds in the 
learned behavior of a true dialect? 
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+ Is the dialect a primary component of the speech disorder 
or is it a compensatory reactive change to a more primary 
one? 


@ What specifically is happening at the phonemic, stress, 
and pitch-inflectional levels that gives the impression of 
foreign dialect? 


+ Ofthe thousands of patients who develop speech changes 
due to cerebral disease, why have so few demonstrated 
this unusual dialectal change? 


è Are the pitch changes that partially characterize the 
dialect a signal that those regions of the brain that sub- 
tend musical production have been impaired? 


è What is the connection between the onset of foreign 
dialect and the high incidence of its occurrence during 
the recovery stages of dysarthria, apraxia, and aphasia? 


If one were to submit the following speech characteristics 
to a specialist in neurologic communicative disorders with- 
out informing the clinician of the patient’s history or clinical 
studies, into what classification of speech or language disor- 
der would these signs best fit? 


+ Distortions and prolongation of vowels and semivowels 


+ Consonants produced with slightly off-target tongue and 
lip placements (1.e., the sound is an allophonic deviation) 


Substitution of one phoneme for another 


+ Blocking on initial consonants 


+ 


Equal stress placed on all syllables in words and sen- 
tences 


Prolongation of silent intervals between words 
Insertion of vowel sounds between words 


Inappropriate pitch patterns 


¢- >% è% o 


All the above produced inconsistently 


These composite features of patients with “foreign dialect” 
most closely resemble the patient who has apraxia of 
speech; they are characteristic of neither dysarthria nor 
aphasia. That is to say, they are similar to apraxia in a general 
sense. Obviously, there is something special about the “for- 
eign dialect” patient's specific errors that distinguishes that 
person from the typical apraxic. Pitch inflectional patterns 
may be the major factors responsible. 

A search through the histories of such patients reveals a 
common denominator: the presence of severe dysarthria 
and apraxia of speech. Although some are also aphasic, in 
several no aphasia was present. 

A 27-year-old female who had sudden onset of numbness in 
her right hand, right facial weakness, mild dysarthria, and no 
previous history of foreign language exposure eventually devel- 
oped an apraxia of speech. As the hours passed, her rate of 
spontaneous speech again became normal. However, she spoke 
with a foreign accent that several independent witnesses iden- 
tified as German. The “r” sound was susceptible to mispronun- 
ciation and she hesitated slightly in an apparent search for 


either the right word or articulatory position, yet her speech 
was free, appropriate, and completely intelligible. There was no 
dysphasia, and when examined 1 month later and 7 weeks 
after onset, her speech had returned to normal in all respects. 
Her foreign accent had disappeared (Whitty, 1964). 

A 31-year-old female, after awakening and relieving her- 
self, became dizzy, could not get words out, and by the end of 
the day was able to say only one-syllable words, although 
she never had difficulty thinking of the right words. By the 
end of 3 weeks, she was talking in complete sentences, at 
first in a flat intonational pattern. As inflectional changes 
returned, they took on a Spanish-sounding accent. Another, a 
46-year-old female with cerebral ischemia following clamp- 
ing of the left common carotid artery and left cervical sym- 
pathectomy for left internal carotid artery aneurysm, had a 
“New England” dialect for the first time in her life and an oral 
verbal apraxia in which she groped for tongue positions and 
initiated words incorrectly. She was also mildly aphasic. 

Of 13 Mayo Clinic histories, 8 (62%) had apraxia of speech 
as an antecedent to the onset of the “foreign dialect.” 

A 41-year-old woman developed a Germanic-sounding 
dialect after a gunshot wound in the right occipital and pari- 
etal bone. The patient had spent 3 years in Germany, where 
she had learned German while married to an Air Force pilot. 
She had a left hemiparesis, left homonymous hemianopsia, 
spasticity of the right leg, apraxia of the left leg, an ataxic 
gait, and an abnormal EEG indicating a focal abnormality 
with epileptogenic components, from the right posterior 
temporal regions predominately. The evolution of her dialect 
began with complete absence of speech and a period of 
dysarthria and apraxia of speech that gradually disappeared, 
leaving the “foreign dialect” in its wake. 

Broca’s area is most frequently implicated in cases of sud- 
den onset of foreign dialect. Very few dispute this localization 
except Cole (1971), who wrote about a patient who devel- 
oped an Eastern European dialect after a brain-stem lesion. 


Summary 


è Neurologic voice disorders are components of dysarthria 
and are classified into the same subtypes as the 
dysarthrias, based on the location of the lesion in the 
nervous system and muscular pathophysiology. 


+ Most neurologic voice disorders are acoustically distinc- 
tive, which contributes to their differential diagnosis. 


+ Neonates and infants with neurologic disease cry abnor- 
mally. 


+ Vocal fold paralyses due to tenth cranial nerve (vagus) 
lesions and their effects on voice depend upon the loca- 
tion of the lesion along the nerve pathway from brain 
stem to muscle. 


Ever present is the danger of misdiagnosis of neurologic 
dysphonia as being psychogenic, especially those 
resulting from mild neurologic diseases of the tenth cra- 
nial nerve and from early bulbar myasthenia gravis. 
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+ Flaccid dysphonias caused by tenth cranial nerve or 
myoneural junction disease are generally breathy and 
reduced in loudness due to vocal fold hypoadduction. 


Spastic (pseudobulbar) dysphonia generally has a harsh, 
strained-strangled quality due to vocal fold hyperadduc- 
tion. 


+ Hypokinetic (parkinsonian) dysphonia is generally char- 
acterized by monopitch and reduced loudness resulting 
from the rigidity of muscles. 


+ Ataxic dysphonia, when present in ataxic dysarthria, may 
be tremorous or may show sudden bursts of loudness due 
to reduced muscular feedback. 


+ Choreic dysphonia consists of sudden alterations in pitch, 
loudness, and quality and, often, voice arrests, resulting 
from uncontrolled jerking muscular movements. 


¢ Dystonic dysphonia consists of slower alterations in pitch, 
loudness, and quality and, often, voice arrests, resulting 
from slower, uncontrolled muscular movements. 


+ Organic voice tremor can exist in isolation or as one of 
several tremors distributed throughout the body. The 
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Chapter 6 


Spasmodic Dysphonia 


When we approach the problem of spastic dysphonia we 
see so many different opinions about any of its many 
aspects that we come to the conclusion that one of the fol- 
lowing two things has happened: either we are not speak- 
ing about the same thing or we look at the same thing with 
different eyes. And these “eyes” depend upon our culture 
and on the understanding that even scientific “truths” are, 
frequently, provisional. 


—Bloch 


In this chapter, we discuss a voice disorder that has a history 
of being the most serious, controversial, and challenging 
regarding its etiology and treatment of all voice disorders 
in our field. Once upon a time it used to be called spastic 
dysphonia. Contemporary clinicians and researchers have 
settled upon spasmodic dysphonia, originally recom- 
mended by Aronson et al. (1968a,b) because of the confus- 
ing implications of the word spastic (confusable with the 
spasticity of pseudobulbar dysphonia, which can produces 
similar voice signs and symptoms). Throughout its history, 
spasmodic dysphonia has had many synonyms: spastic 
dysphonia, lalophobia, phonatory glottic spasm, mogipho- 
nia, psychophonastachesia, and stuttering at the glottal 
level. 

The term spasmodic dysphonia is defined as voice breaks 
or stoppage stemming from cessation of vocal fold oscilla- 
tion generally from hyperadduction of the true and/or false 
(ventricular) folds during attempted phonation, producing a 
voice that is strangled, hoarse, and effortful, which is often 
continuous or sustained, intermittent or choppy. In some 
patients, the vocal fold adductor spasms can be so intense 
that a patient shows engorgement of the veins of the neck, 
flushing of the face, or at times cyanosis accompanied by 
grunting, groaning, and extreme fatigue. The voice is so 
bizarre that it is profoundly incapacitating, disturbing to 
patient and listener, having profound psychiatric, social, and 
vocational repercussions severely compromising one’s qual- 
ity of life. 


Historically descriptions of spasmodic dysphonia have a 
long history. In 1875 Jonann Schnitzler, a prominent laryn- 
gologist in Vienna, first described a condition he called 
"spastic dysphonia," where symptoms ranged from severe 
cases of vocal cord cramping preventing emission of sound, 
to milder cases where the voice is forced each time phona- 
tion is attempted. Subsequently, other clinicians described it 
and made astute observations about its characteristics, range 
in signs and symptoms, and possible influencing variables. A 
selected sample of these early observations include: a spas- 
modic activity of the tensors of the vocal cords that provoke 
a feeble, intermittent, jerky voice (Mackenzie, 1880); exces- 
sive innervation that goes beyond the voluntary occlusions 
(Stern, 1928); a rare illness, most of the time undiagnosed 
and difficult to treat, that attacks subjects who do much 
speaking (Nadoleczny, 1938); a disordered phonation char- 
acterized by vocal force, grunting, and inhibition; whispered 
voice in severe cases going to total aphonia with maximal 
tension. The disorder is manifested above all during commu- 
nicative speech, more rarely in speaking without listeners 
(Berendes, 1938). A disorder of speech, grunting, can be 
emitted in a forced fashion with vowels often separated into 
two accentuations. The disorder is rare. Because its severiety 
is often dependent upon the situation or emotional status of 
a patient, numerous authors classify it among the neuroses 
(Luchsinger, 1949); not as a clinical entity, but as being 
symptomatic of a larger neurologic syndrome (Robe, Brum- 
lik, Moore 1960). From the late 1800s through the middle of 
the 20th century, clinicians were mixed about the etiology of 
this disorder. 


+ Terminology and Definitions 


Spasmodic dysphonia is an example of the way in which the 
meanings of words change with time and the expansion of 
knowledge and of the confusion that accompanies such 
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change. Early in the history of this disorder, those few clini- 
cians who wrote about it seemed to be in reasonable agree- 
ment that spasmodic dysphonia was intended to mean a 
single entity, distinct from other voice disorders, which con- 
formed to fairly narrow criteria. The original definition of 
adductor spastic dysphonia, reviewed by Arnold (1959), is 
paraphrased as follows: 


+ A voice that is variably squeezed, strained, choked, stac- 
cato, stuttering-like, jerky, grunting, groaning, effortful, 
pinched, grating, and has periodic breaks in phonation. It 
has a tendency to be monopitched and reduced in loud- 
ness, and vowels are initiated with hard glottal attacks. 


+ The abnormal voice occurs only during voluntary phona- 
tion for communication purposes and not during singing, 
vowel prolongation, laughing, or crying. 


+ The abnormal voice is the effect of hyperadduction of the 
true and false vocal folds. 


+ The disorder is caused by psychoneurosis from either 
occupational stress or emotional trauma, such as family 
conflicts, accidents, terrifying events, or accumulated 
frustrations. 


What needs to be remembered is that this definition orig- 
inated with a few otolaryngologists and psychiatrists, whose 
experience with patients who had this kind of strained voice 
was limited and whose knowledge of both the psychiatric 
and neurologic characteristics of their patients paralleled the 
relatively undeveloped state of both specialties. The first 
article written on the subject by Traube in 1871, setting into 
motion and directing subsequent thought on this disorder, 
described it as “a spastic form of nervous hoarseness.” At 
least a half dozen other writers before 1900 also linked the 
disorder to psychoneurosis (Kiml, 1963). Their early impres- 
sions about psychogenic etiology, given at the dawn of mod- 
ern psychiatry and psychology, have ultimately proved to be 
correct for some of the patients exhibiting this disorder. 
However, subsequent experience has shown—particularly 
during the past 20 years—that what these clinicians had 
failed to notice, understand, and document during the early 
years was that there were people whose affected voices 
sounded very much like the spastic dysphonia as defined 
originally but were caused by neurologic, not psychogenic, 
disorders. This, too, is understandable because the specialty 
of neurology at the turn of the century was in a similar state 
of primitive evolution, as were psychiatry and psychology. 

With the passage of time, more clinicians began to witness 
cases of strained hoarseness of different types, many of 
which were eventually suspected of being of neurologic eti- 
ology, and still others for whom neither psychoneurosis nor 
neurologic substrates could be demonstrated. Although 
these patients failed to conform to the early criteria for spas- 
tic dysphonia, their disorders were called spastic dysphonia 
nevertheless, mainly because that was the way they 
sounded, and diagnosis is made by auditory perception. 

Terminology being vitally important to clarity, we are 
faced today with the following dilemma: either there is only 
one true spasmodic dysphonia, the specific criteria for which 
were set down by the early writers on the subject and just 


listed in this chapter, or all other strained voices are not true 
spastic dysphonia and should either be called something 
else, or the term should be modified to specify etiology. We 
no longer use the term spastic dysphonia. It has been 
replaced by spasmodic dysphonia advocated by Aronson 
(1968a) when he wrote 


We would like to advocate a change in the term used to 
describe the voice disorder we have been calling spastic 
dysphonia. In the neurologic world the term spastic is 
commonly used to imply disease of corticobulbar or corti- 
cospinal (pyramidal) pathways. .. . It is also our observa- 
tion that the strained voice quality in this disorder waxes 
and wanes from moment to moment in a spasmodic 
fashion.’ For these reasons, therefore, we propose the term 
spasmodic dysphonia to prevent the confusion that arises 
from the use of the term spastic (p. 200). 


The viewpoint taken in this book is that the latter course is 
preferable to the former because it recognizes the ever- 
widening recognition of the multiple origins of adductor 
laryngospasm. Therefore, the following definition and crite- 
ria pertaining to the voice disorder concept of spastic 
dysphonia are recommended, based on evidence to be 
presented later in this chapter: 


+ The term adductor should be used as a prefix to designate 
a class of voice characteristics produced by hyperadduc- 
tion of the vocal folds. 


¢ Continuing along this line of thinking, then, adductor 
spasmodic dysphonia is a perceptual term that refers to a 
strained, groaning, staccato, effortful voice, no different 
from the original definition. 


+ It varies within and among patients according to severity 
and detailed voice characteristics. 


+ Though generally neurologic, it may be of psychogenic, 
neurologic, or of unknown (idiopathic) etiology. 


If data lead the clinician to conclude that the laryn- 
gospasms are psychogenic beyond a reasonable doubt, 
then the term used should be psychogenic adductor spas- 
modic dysphonia. 


If data lead the clinician to conclude that the laryn- 
gospasms are neurologic beyond a reasonable doubt, then 
the term used should be neurologic adductor spasmodic 
dysphonia. 


+ If data fail to prove either psychogenic or neurologic etiol- 
ogy, then the term idiopathic adductor spasmodic dyspho- 
nia should be used. 


In summary, a modern definition of the term adductor 
spasmodic dysphonia is that it applies to a family of strained 
voices produced by adductor laryngospasm that arise from 
different etiologies. These laryngospasms are abnormal 
involuntary movements that are action-induced by speak- 
ing for specific tasks as the larynx is generally normal at rest 
(Brin, Fahn, Blitzer, et al., 1992). To distinguish among them 
is not a trivial academic exercise. It has considerable impor- 
tance to clinicians responsible for establishing a differential 
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diagnosis of laryngospasms in patients who along with their 
physicians need to know the truth about their abnormal 
voices and whose treatment may depend upon the type of 
adductor spasmodic dysphonia that they have. 


¢ Adductor Spasmodic Dysphonia 


Pathophysiology 


The common denominator underlying all types of adductor 
spasmodic dysphonia is adductor laryngospasm of the vocal 
folds. While keeping in mind that within and among patients 
the degree and frequency of adductor laryngospasms vary 
considerably, the following kinds of laryngeal adductor, and 
sometimes even sphincter, actions have been observed dur- 
ing videofluoroscopy and videofiberoptic laryngoscopy 
(McCall, Skolnick, and Brewer, 1971; Parnes, Lavarato, and 
Myers, 1978): 


+ Adductor spasms of the true vocal folds only Hyperadduc- 
tion of the true vocal folds alone occurs in mild cases. 
Although the degree of adductor force cannot be seen 
readily during such spasms, the midline separation 





Figure 6.1 Frames from anteroposterior and lateral cinera- 
diography of vocal folds during onset of laryngospasm in 
spastic dysphonia. (A) Glottis is open. (B, C) True and false 
vocal folds, respectively, relatively demarcated by laryngeal 
ventricle in between. (D) Hypopharynx is open, the larynx still 
relatively low in the neck. 


between the ligamentous portions of the vocal folds may 
be obscured owing to compression of one vocal fold 
against the other. The vocal folds can be seen to snap shut 
synchronously with each voice arrest. 


+ Adductor spasms of both the true and false vocal folds 


During moderate to severely strained voice or voice 
arrest, the false or ventricular vocal folds close over and 
obscure the true folds. 


+ Supraglottic constriction Severely strained voice or voice 


arrests are accompanied by constriction of the hypophar- 
ynx (i.e., the inferior pharyngeal constrictor just above 
the level of the false vocal folds). The shape of the lumen 
during such closure is not an anteroposterior, elongated 
one, as in true and false vocal fold hyperadduction alone, 
but is sphincteric or circular. Moreover, the true and false 
folds hyperadduct along with the pharyngeal constric- 
tors. What appears to be true, then, is that the more 
severe the adductor laryngospasms, the greater the verti- 
cal extent of intrinsic laryngeal muscle participation. As 
tightness of spasms increases, constriction progresses 
superiorly from the true folds to involve the ventricular 
folds, and, ultimately, the pharyngeal constrictors (Figs. 
6.1, 6.2). The acoustic consequences of these constrictions 
are displayed in a sonogram in Figure 6.3. 


Figure6.2 Frames from anteroposterior and lateral cineradi- 
ography of vocal folds at height of laryngospasm in spastic 
dysphonia. (A) Glottis is closed. (B, C) True and false vocal 
folds, respectively, massively adducted and poorly demar- 
cated. Laryngeal ventricle is obliterated. Vocal fold position is 
elevated. (D) Hypopharynx obliterated owing to sphincteric 
closure and elevation of larynx. 
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Figure 6.3 Sonogram of a prolonged vowel produced with normal 
voice (A) and a spasmodic dysphonic voice (B). Vertical striations (each 
one representating a glottal pulse) in A are equally spaced and regular, 
whereas in B they are irregular. In A, dark bands of energy and formats 
run across the length of the sonogram, but are interrupted and difficult 
to ascertain in the vowel prolongation in the lower figure because of 
voice breaks. 


Although hypercontraction of the intrinsic laryngeal and 
pharyngeal muscles is fundamental to the adductor spas- 
modic dysphonias, moderate to severe laryngospasms are 
accompanied by synchronous movements of the entire lar- 
ynx, most often in a superior (cephalad) direction, as shown 
in the succeeding figures. Associated laryngeal movements 
implicate the extrinsic laryngeal muscles as well. Such 
movement can also be seen through the skin of the neck. 
Downward and forward displacement of the larynx occurs 
less often. As expected, increased amplitude of electromyo- 
graphic potentials in the general neck region have been 
recorded during such spasms (Tarrasch, 1946). It is impor- 
tant to realize, then, that the adductor spasmodic dyspho- 
nias are disorders not just of the true and false vocal folds but 
also of the supraglottic pharyngeal constrictors and the strap 
muscles, moving the body of the larynx. The neuroanatomic 
implications of these facts is that not only the tenth cranial 
nerve, innervating the intrinsic laryngeal muscles, but also 
the ninth cranial nerve to the pharynx and the cervical spinal 
nerves to the extrinsic laryngeal muscles may transmit nerve 
impulses responsible for the spasms. However, this observa- 
tion is not to be misinterpreted as suggesting the problem is 
peripheral, because current theory suggests the muscle and 
nerve abnormalities reflect central remodeling rather than a 
peripheral disorder. 

Aberrant respiratory movements often accompany the 
laryngospasms. The clinician can see sudden, jerky arrests 
and other dysrhythmic movements of the thorax and 
abdomen synchronous with strained voice or voice arrests. 
Respiratory dysfunction is most probably the secondary 
effect of the uncontrolled glottic closures (i.e., during adduc- 


tor spasm, increased exhalatory effort is required to force air 
through the constricted glottis). If an adductor spasm is par- 
ticularly abrupt, the exhalatory muscles react to the sudden 
laryngeal arrest of exhaled air by contracting more quickly 
and forcefully, actions that can be seen with the naked eye. 
The speaker, in effect, is caught in the throes of a series of 
involuntary effort closures or Valsalva maneuvers. The nor- 
mal speaker can simulate adductor spasmodic dysphonia 
realistically by trying to phonate while straining or bearing 
down. Notice how exhalatory arrests are accompanied by 
sudden, visible abdominal and thoracic movements. This 
might lead clinicians to the incorrect conclusion that the res- 
piratory behaviors are all secondary. This notion is dispelled 
by presentation of six patients reported by Brin (1992) who 
all exhibited respiratory adductor abnormal involuntary 
adduction of the vocal folds on respiration but were normal 
during phonation. Whether voice or respiratory in origin, 
severe laryngospasm produces other visible features, such as 
flushing of the face. Silent lip movements may occur during 
moments of voice arrest. Even stuttering-like articulatory 
repetitions during inaudible voice are common as the patient 
struggles, by repetition, to compensate for moments of unin- 
telligibility. Other signs of struggle are excessive contractions 
of the neck muscles, shoulder girdle, and upper arms. Typi- 
cally, patients with a severe, long-standing dysphonia will 
present with a strained facial expression consisting of frown- 
ing, squinting, and downward turning of the angles of the 
mouth. What is unseen by the observer is the intense physi- 
cal effort and fatigue felt by the patient. So tiring is speaking 
that many patients elect to whisper because of the freer air- 
flow that it provides or become recluses and avoid talking 
whenever possible. It should come as no surprise then that 
therapeutic alleviation of laryngospasm is as much a physical 
relief as it is a contribution to voice improvement. 


Incidence 


Although the incidence and prevalence of adductor spas- 
modic dysphonia in the general population is unknown, the 
National Spasmodic Dysphonia Association estimates there 
are between 35,000 and 50,000 Americans afflicted with 
this disorder. Additionally, limited data on frequency of 
occurrence, age, and sex in individual practices are avail- 
able. The reader must bear in mind two kinds of errors in 
reporting: (1) Most past and even current reports do not 
identify the patients studied according to etiologic type. 
Patients are chosen almost exclusively on the basis of the 
sound of their voices, and the clinicians making judgments 
have varying knowledge and experience with the disorder. 
Quite likely, then, these are not homogeneous populations. 
More probably, they represent mixtures of psychogenic, 
neurologic, and idiopathic types. (2) In collecting statistical 
data in clinical practice, some clinicians fail to recognize 
the disorder when confronted by it, and others identify it 
erroneously when it does not exist. These sources of error 
must be considered, particularly the first, when critically 
reading studies reported here and elsewhere. Problems 
encountered in critically reading existing literature to 
determine the incidence, age of onset, or etiology are not 
limited to the clinical sources of error. An evidence-based 
literature search on all articles published on spasmodic 


6 Spasmodic Dysphonia 105 





dysphonia medical interventions between 1966 and 2000 
found that whereas there were 1324 subjects included in 
the 24 articles meeting search criteria, the actual number of 
different subjects was estimated at only 395. Consequently, 
readers also need to recognize that a single center publish- 
ing several articles on the same, or nearly same, set of sub- 
jects can skew general impressions of a population toward 
whatever unique characteristics their population might 
appear to possess such as age of onset, concomitant health 
problems, or specific etiology. The following data and those 
later in this chapter are given with awareness of these 
limitations. 

The consensus is that adductor spasmodic dysphonia is 
rare. Most clinicians who do not work in specialty clinics 
witness only a few cases over a lifetime, and some have 
never had any direct experience with such patients. 


Age of Onset 


Most research publications cite middle age as the time of 
onset. Aronson (1968a) reported distribution of age of onset in 
spastic dysphonia in 34 patients. The average age was 44 years 
(range, 28 to 69 years): 20 to 29 years, 9%; 30 to 39 years, 24%; 
40 to 49 years, 29%; 50 to 59 years, 20%; and 60 to 69 years, 
18%. Brodnitz (1976) reported a mean age of 50.2 years in 130 
patients with the following distribution: 20 to 30 years, 10%; 
31 to 40 years, 16%; 41 to 51 years, 27%; 51 to 60 years, 29%; 
older than 60 years, 18%. His youngest patient was 27 years, 
his oldest 76 years. In another study of 100 patients. Aronson 
(1979) found a median age of onset of 50 years. 

The difficulty getting a handle on the incidence is under- 
scored by an evidence-based literature search on medical 
interventions reported between 1966 and 2000 that 
although 1324 total subjects were reported in the 24 articles 
reviewed, the number probably represented a total number 
of 395 different subjects (Duffy and Yorkson, 2003). 


Sex Ratio 


The male to female ratios reported in different studies range 
from ~1:1 to 1:4. Because patients usually present them- 
selves for examination, they introduce a self-selection factor 
that may not be a true indication of incidence in the general 
population. 


Onset and Course 


With rare exception, adductor spasmodic dysphonia begins 
insidiously as a nonspecific hoarseness, at first fluctuating 
in severity, with intervening periods of normal voice. Then, 
gradually, the strained adductor laryngospasms intrude into 
the hoarseness. The disorder may plateau or continue to 
worsen until phonation during speech is all but impossible. 
From a follow-up study of 100 patients, Aronson (1979) 
found that the median time necessary for spastic dysphonia 
to develop into its full-blown state from onset was approxi- 
mately 1 year in both males and females. Although some 
patients consulted a specialist almost immediately after 
onset, others waited as long as 8 years. The median period 
from onset until consultation was 2 months. These patients 
consulted an average of four specialists, although some had 
gone to as many as 25 in search of help. Izdebski, Dedo, and 


Boles (1984), in a survey of 200 patients, found gradual 
onset in 84% of patients; the remainder reported sudden 
onset. They found that in patients whose disorder had 
begun gradually, the voice declined and reached a plateau 
within 6 to 9 months. Patients whose voices intermittently 
fluctuated between abnormal and normal from onset dete- 
riorated and reached a plateau after ~23 months. Once the 
disorder takes hold, remissions are rare unless the spastic 
dysphonia is of the conversion reaction type. In an 18-year 
follow-up study of 10 spastic dysphonic patients, Boren- 
stein, Lipton, and Rupick (1978) found that in all 10 patients, 
the disorder had remained; four reported no change during 
the intervening years, two reported that their voices had 
worsened slightly, three had mildly improved, and one 
improved substantially. 


Factors Affecting Severity 


During the life of the disorder, patients report unexpectedly 
wide fluctuations in severity. Periods of emotional stress 
make the disorder worse. Talking on the telephone and to 
authority figures is especially difficult, and the voice is vul- 
nerable to moods of anxiety and depression. Physical labor 
also can cause the voice to worsen. One patient reported that 
if he knew he needed an especially good voice for some occa- 
sion, he would refrain from lifting, pushing, or digging. The 
temptation is to conclude from such testimony that volun- 
tary effort closure increases susceptibility to spasm. Yet, at 
other times, the spasms can be mild, prompting many 
patients to claim periods of nearly normal voice. These 
moments of relief are usually accompanied by feelings of 
relaxation and freedom from anxiety. For that reason, the 
voice is often much better while the patient is on vacation. 
One patient reported that while on a trip to Hawaii, her voice 
began to improve incrementally in proportion to the dis- 
tance from the California coast, so that by the time she had 
reached the islands her voice was the best it had ever been. 
On her trip home, the closer her plane got to the mainland, 
the worse her voice became. By the time the plane had 
landed, her dysphonia was as bad as when she had left. 
These patients have a baffling ability to produce normal 
voice when surprised or taken off guard, such as when they 
are greeted by a passer-by or spoken to from behind. The 
common factor seems to be a high level of situational spon- 
taneity in which the patient does not have time to think 
before speaking. The more patients concentrate on what 
they wish to say, the worse their voices become. Many 
patients report that even the affective content of their 
thoughts and mood fluctuations can alter severity; that 
when thinking pleasant thoughts their voices are better, the 
reverse being true for unpleasant ones. Normal voice during 
singing was one of the earliest observations described and 
contributed to early postulations that the problem was 
psychogenic in origin. It is logical to conclude that spasms 
are linked to volitional, intellectual, or executive speech 
functions and disappear during uninhibited, emotional, 
automatic, nonintellectual speech. Contemporary writings 
suggest symptoms of spasmodic dysphonia result from 
abnormal motor responses to sensory input and laryngeal 
feedback, possibly by disinhibition of laryngeal motor 
responses. 
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Factors Associated with Onset 


The clinician’s ability to uncover factors associated with 
onset of any disorder is contingent upon the patient’s ability 
to remember or willingness to disclose such information. 
Even when patients report events antecedent to or concur- 
rent with onset, there is a danger that the examiner will con- 
clude they are necessarily cause and effect. Also, because the 
onset is often insidious and most patients do not seek help 
immediately, it may be difficult for them to be accurate 
reporters, and they, too, may be trying to associate a cause 
and effect. 

Upper respiratory infections, as discussed elsewhere in 
this book, are commonly reported with the onset of all voice 
disorders. Asked if their disorder had occurred after a cold, 
sore throat, or laryngitis, 35% of patients answered in the 
affirmative, and 17% said it occurred after a “flu-like” illness 
(Aronson, 1979). Izdebski, Dedo, and Boles (1984) found that 
15% of their patients reported nonspecific vocal tract illness 
as well as colds and viruses as being associated with onset. 
What remains unknown is whether these laryngeal and 
respiratory illnesses were true infections that had some phys- 
ical bearing on the disorder or whether they were uncon- 
scious simulations of respiratory infections that provided the 
psychologic rationale for announcing the disorder, as is 
known to occur in psychogenic voice disorders as a group. 

An unavoidable fact, again not necessarily indicative of 
cause and effect but in many cases highly suspicious and 
even compelling, is the co-occurrence of acute or chronic 
psychologic stress. It is possible that the neurochemical 
changes co-occurring with stress contribute to the disorder 
or that the stress of struggles to communicate cause stress. 
In their survey of stress and emotional trauma, Izdebski, 
Dedo, and Boles (1984) found that ~16% of patients thought 
that stress had caused their disorder. Brodnitz (1976) found 
that 40% of his patients had described what he called trigger 
incidents: death of a relative, severe automobile accident in 
which the patient was not physically harmed, witness to 
murder of a relative, divorce of a relative, unremitting work 
stress, marital conflict, and perpetually demanding responsi- 
bility. The term trigger incident is an apt one in any discussion 
of cause and effect, for later in this chapter the distinction 
will be made between stress as a primary psychogenic cause 
and stress as a trigger or precipitant of a latent neurologic 
disorder. Meanwhile, one should resist the temptation to 
conclude that cases are primarily psychogenic merely on the 
grounds that the disorder coincided with upsetting life cir- 
cumstances. 

In a study of 100 spastic dysphonia patients, Aronson 
(1979) found an extraordinarily high percentage of patients 
who had reported a close temporal relationship between 
stress and the onset of their voice disorder. Did spastic dys- 
phonia occur after a sudden emotional event? Forty-five per- 
cent said it had. Had the dysphonia been accompanied by a 
continuous, ongoing stressful situation? Fifty-five percent 
answered affirmatively. One must then ask would similar 
percentages of persons admitting to stress occur in the 
normal-speaking population. 

The following are samples of patient reports of acute, 
emotionally traumatic events shortly after which the dys- 
phonia surfaced: “Auto accident; family death; son got 


divorced; angered at work over a situation I could not han- 
dle; death in family followed by marital, family, and business 
stress; dispute over property; daughter’s marriage; unmar- 
ried daughter became pregnant; brother killed in auto acci- 
dent and husband died of heart attack in close succession.” 
On a frequency of occurrence basis, family illness and death 
rank high as antecedents of adductor spasmodic dysphonia. 

Typical chronic stress situations reported were “hectic 
worried life; unemployment; hospitalizations; pushing self 
too hard; continuous job pressure; under tension because of 
husband's business; having marital difficulties; passed over 
for promotion; responsibility for caring for alcoholic brother; 
responsibility for caring for aged mother; taking care of 
invalid husband and his eventual death; own cancer surgery; 
homosexual son living at home and his objectionable friends 
who come to visit.” 

Many patients have no recollection of physical illness or 
emotional stress at the time of onset of their dysphonias. 
Whereas in some cases they have forgotten, in others they 
may be guarding or repressing sensitive information. In still 
others, no event occurred at the time of onset. An attempt 
will be made in the Etiology section to clarify the implica- 
tions of these variations. 


Psychosocial Effects 


It does not matter if the etiology of adductor spasmodic dys- 
phonia is psychogenic, neurologic, or idiopathic; the voice 
disorder in its own right is a dire threat to the patient’s 
psychologic equilibrium. Self-consciousness, feelings of inad- 
equacy, paranoid ideation, diminished interpersonal commu- 
nication, social withdrawal, alcoholism, depression, and even 
suicidal tendencies are found in a majority of these patients 
to some degree. The intensity of these emotions is roughly 
proportional to the severity and duration of the voice disor- 
der and to the importance of communication in the patient’s 
daily life. Some patients are prime candidates for psychiatric 
and psychologic help strictly on the basis of the voice effects 
alone. These patients prove how considerably important nor- 
mal voice is to the maintenance of a secure self-image and 
how alienated we become with ourselves and society when 
our self-image, epitomized by our voices, is threatened by our 
own and others’ adverse reactions to it. 

Spasmodic dysphonia affects the emotional, social, and 
occupational life of each person differently, depending upon 
each patient’s premorbid personality and life circumstances. 
The extent of incapacitation depends upon the patient’s 
reliance on speech occupationally and socially. Occupational 
effects can be devastating in those whose livelihood is heavily 
dependent upon oral communication, for example, teachers, 
attorneys, physicians, clergy, entertainers—any human 
endeavor in which the personality, ethos, or persuasiveness of 
the speaker is integral to that person’s need to communicate. 
In a survey of the effects of spasmodic dysphonia on job per- 
formance, Izdebski, Dedo, and Boles (1984) found that the 
dysphonia seriously interfered with job performance in 93% of 
men and 77% of women, and, as a result, 26% of men and 37% 
of women were forced to change jobs or to be reclassified, 
usually with adverse financial and emotional consequences. 
Aronson (1979) found that 64% of spasmodic dysphonic 
patients surveyed reported having become socially withdrawn 
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and depressed after developing the condition. Forty-one per- 
cent said it interfered with their earning a living. The follow- 
ing are excerpts from that survey. It is likely that if these stud- 
ies were to be repeated today, the percentages affected would 
be even more dramatic because in the ensuing 25 years, sig- 
nificantly more women have entered all aspects of the profes- 
sional workforce. Representative samples of occupational, 
emotional, and social effects reported by patients in response 
to Aronson’s survey follow. The comments are devastatingly 
telling of the adverse impact of this disorder. 


Occupational Effects 
+ “Itcaused me to retire. I could no longer work like I used to.” 
+ “Thad to be placed on disability pension.” 


+ “In my work, the ability to communicate is essential. My 
voice has resulted in my passing up or not doing some 
things I might have done otherwise for added income, 
like negotiating teacher contracts.” 


“By the time I reached retirement age I was ineffective on 
the telephone, inadequate in communicating with 
employees or in participating in management sessions. I 
had to quit.” 


+ “I live in a secluded world and don’t apply the potential I 
know I have. I don’t work toward a better life.” 


+ “Ihave missed several promotions due to my voice situa- 
tion.” 


+ “I have kept my job as a college professor by using ‘self- 
paced’ nonlecture methods of teaching. But, this requires 
more time, and my productivity has dropped.” 


+ “Mine is a small business and I cannot find help. I have to 
wait on customers and communicate with them. I become 
very worn out and tired, angry at myself and depressed.” 


+ “My job as a consultant consists of sales pitches directly, 
and due to my voice problem, I haven't been able to com- 
municate properly with customers. I work on a commis- 
sion basis, and sales have dropped.” 


+ “How can clients have confidence in me as their attorney 
with a voice like this? Some of them look at me strangely, 
and when there is any office noise they can’t hear me. 
Trial work is impossible, now. I let my partner do that.” 


Social Effects 


+ “I no longer want to go to social events or meet with 
people.” 


+ “Itry to avoid talking to people. I avoid meetings of any 
kind; going to people’s houses or having guests at home. 
It’s embarrassing.” 


“I cannot speak comfortably in crowded rooms, automo- 
biles, buses, or airplanes.” 


+ “I have been hesitant and have withdrawn from taking 
office in several clubs and fraternal activities where use of 
the voice is involved.” 


+ “People shy away because they cannot hear, and I, in turn, 
do likewise, because I am aware of their discomfort.” 


+ “Lam unable to carry on a conversation without straining 
my throat and am silent most of the time, even at home. 
When I am with other people I listen and seldom enter 
the conversation.” 


+ “Its embarrassing to talk, because people stop and stare. 
Children and teenage boys ridicule me.” 


“I can’t visit as I would like. Sometimes my experiences 
have been knowledgeable, and I would like to express 
them, but I cannot, and it appears to others that I am stu- 
pid.” 


+ “Lused to sing in the church choir. Now my singing is hor- 
rid.” 


+ “I'm not completely withdrawn but am more selective. I 
try to avoid one-on-one social events and tend more to 
group events. Also, I try to socialize with those who have 
heard me and understand my problem.” 


Emotional Effects 


+ “I try to talk and words won’t come out. | get frustrated 
and feel like crying or just getting away from people. I get 
frustrated, then angry, but it’s hard to get angry when you 
can’t raise your voice.” 


+ “I really freaked out and was put in the hospital for two 
weeks under sedation. After I came home it was several 
months before I had any interest in living. I even contem- 
plated suicide, but I felt I couldn’t do that to my family.” 


+ “Lack of vocal confidence is a very emotional experience 
when you have talked normal all of your life and then 
your voice becomes less proficient; you tend to become 
depressed. The question ‘Why?’ is always present.” 


+ “Certainly, any person who enjoys life and people would 
be depressed. It’s hell not being able to communicate.” 


+ “Ihave been depressed much of my life; always had suici- 
dal tendencies; couldn’t communicate with the world, 
and children especially.” 


+ “Black moods. Hopelessness. Helplessness. Haplessness.” 


+ “I feel like my life is being wasted and that I’m being a 
burden to my husband.” 


+ “It’s hard to put into words, but I feel shut away from the 
world.” 


+ “My self-concept has become much more negative. I 
withdraw socially and become depressed when I think of 
how well I used to do things, like meet people and make 
friends. It also had an effect on my marriage. I feel it was 
the cause of our separation.” 


+ “I began to feel very sorry for myself. I felt I was being 
punished for something I had done wrong in life. My mar- 
riage problems grew worse as I focused more on myself. I 
was very unsure of everything.” 


The preceding testimonies of the destructive life effects of 
adductor spasmodic dysphonia do not mean every patient 
who acquires this disorder is destined to suffer in this way. 
As stated earlier, this is not a homogenous disorder; many 
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patients after a time learn to live with the disorder. They 
adopt a philosophic point of view, but usually these are not 
the more severe cases. These patients are often the more sta- 
ble and self-sufficient ones who do not rely heavily on 
speech as a means of emotional expression or social satisfac- 
tion and who do not use speech as a primary means of earn- 
ing a living. 


Etiology 


Adductor spasmodic dysphonia should not be regarded as a 
single disorder or disease entity but as a voice sign of any one 
of several different causes. These causes should be specified 
whenever possible and may be neurologic, psychogenic, or 
abnormal motor patterning. To specify cause, a thorough 
clinical study of each patient is necessary. 


Psychogenic Adductor Spasmodic Dysphonia 


Whether or not a sign or symptom is psychogenic is usually 
based on indirect, circumstantial, and impressionistic data. 
The credibility of evidence for psychogenicity as a primary 
cause of any disorder is proportional to the time devoted to 
the psychosocial interview; skill, wisdom, and judgment of 
the clinician; and the honesty, insight, and verbal ability of 
the patient. It is not difficult to find lives riddled with con- 
flict, strife, disappointment, tragedy, insecurity, rage, hostil- 
ity, suspiciousness, and other signs of emotional turmoil. It 
would be too simple, and erroneous, to assign a speech com- 
plaint to any change of job or to an argument with a spouse. 
However, when the following relationships can be demon- 
strated, a strong likelihood that a voice disorder is psy- 
chogenic must be considered: 


1. The onset of the voice disorder and the stressful life event 
occur in reasonable proximity to one another. 


2. Verbal communication is important to the patient in the 
conflict, however trivial it might appear to you. 


3. During disclosure of delicate or confidential information 
to the clinician, the voice becomes appreciably improved 
or normal. 


4. During trial symptomatic voice therapy, the voice 
improves appreciably or becomes normal. 


5. When the conflict is moderated, or the patient takes a 
vacation, the voice improves considerably or becomes 
normal. 


6. The patient reports multiple sensory complaints and 
somatizations. 


7. Obvious psychiatric disturbances. 


Because numbers 1, 3, 4, and 5 are common clinical fea- 
tures among neurologic dystonias (Brin, Fahn, Blitzer, et al, 
1992), the clinician’s diagnostic task is difficult. In fact, diag- 
nosis of psychogenicity might not be possible until the clini- 
cian has begun trial therapy. Few people would argue against 
the diagnosis of psychogenic adductor spasmodic dysphonia 
if, during the course of the psychosocial interview or reduc- 


tion of musculoskeletal tension, the voice returned to nor- 
mal and remained that way for a reasonable duration. 
Organic, neurologic voice disorders do not fluctuate to such 
extremes or disappear completely though on a short-term 
basis non-psychogenic patients with spasmodic dysphonia 
may be able to modify their voices. 


Adductor Spasmodic Dysphonia of Conversion Disorder 
A review of the early literature indicates that the majority of 
clinicians perceived their patients to be psychoneurotic and 
thought their strained, groaning, effortful voices were 
directly caused by and were manifestations of such psy- 
choneurosis. Specifically, the spastic voice was regarded as a 
conversion disorder (Bloch, 1965; Kiml, 1963). Contempo- 
rary experience sustains the validity of this initial etiologic 
conceptualization for some patients. The evidence, then and 
now, is based on the logical relationship between the dys- 
phonia and certain life events surrounding its development 
and the ease and manner in which the dysphonia is allevi- 
ated during the verbalization of emotional conflict and by 
symptomatic voice therapy. A review of the theory of con- 
version disorder in Chapter 10 and references given there 
will remind the student of the strong and symbolic relation- 
ship between unresolved conflict and abnormal voice. 

Spasmodic dysphonia can be a sign of conversion disorder 
no different from conversion aphonia or nonorganic dyspho- 
nia as of other voice descriptions. The spasmodic dysphonia 
serves to erect a barricade against the dangers of self- 
revelation. The squeezed, groaning, effortful voice commu- 
nicates suppression of intense aggression and fear of verbal 
expression. Heaver (1959), one of a handful of psychiatrists, 
convincingly established the conversion type. The following 
study by Heaver, and other cases, can be found in Bloch 
(1965), and these should be read for their psychologic correl- 
ative evidence. The student should always ask in contem- 
plating single-case and group research studies here and 
elsewhere, “What is the likelihood that these stories are rea- 
sonably, symbolically, and temporally related to the voice 
disorder? Or, to what extent are these life storms and the 
dysphonia merely coincidence?” The following is a synopsis 
of Heaver’s classic case of conversion adductor spasmodic 
dysphonia. 


Case Study 6.1 


This is the case of a 56-year-old woman who is married to 
a pharmacist. She manifested envy for her husband’s sev- 
eral academic degrees and was driven to engage in out- 
side activities. Four years before admission, the patient 
gradually became completely aphonic. It was necessary 
for her daughter to learn lip reading. Her aphonia, the 
patient said, followed a series of “severe emotional 
shocks.” Their son was in battle in the South Pacific, the 
husband for a time was thought to have throat cancer, 
she had a painful thrombophlebitis, and was concerned 
over her son’s impending divorce. This last was connected 
very closely to her voice symptoms. When the divorce 
question arose she was no longer permitted to see her 
grandchildren. The son was doing poorly economically. 
Her daughter broke off two successive engagements. 


The ear, nose, and throat examination showed a larynx 
of normal size with injected mucosa. The mobility of the 
congested cords appeared spastic and sphincter-like. 
During phonation, there was congestion of the external 
jugular veins. Vocal analysis revealed spastic respiration, 
low pitch, choked timbre, rigid inflection, hoarse quality, 
weak volume, a limited range, and spastic vocal attacks. It 
was noted that the singing voice was not affected. Diag- 
nosis: typical spastic dysphonia with marked laryngeal 
sphincter action. 

The initial Rorschach study revealed a personality struc- 
ture characterized by much energy and strong self-will, a 
need to maintain control of others and to exert her influ- 
ence on all about her. Marked childish narcissism was cou- 
pled with general negativism, aggression, and hostility. 
Strong self-doubts and concern over self-image of failing 
sexual attraction. In group therapy she sat apart, inter- 
rupted others with comments, and seemed to enjoy her- 
self. 

Minor degree of improvement occurred with voice ther- 
apy. She was not cooperative regarding the advice received 
during psychotherapy. Her voice ran into total spastic apho- 
nia and even intermittent mutism. It was recommended 
that voice therapy be combined with dramatic and sugges- 
tive but non-probing psychotherapy. Gradually, her 
mutism returned to the previous spastic dysphonia. She 
expressed her underlying aggressive and hostile feelings 
toward her family, friends, and medical staff covertly dis- 
guised by facetious and joking attitudes. She maintained 
evasiveness and resistance to any effort, no matter how 
superficially leveled, to analyze her attitudes and voice 
problem. Her behavior is erratic and histrionic; her urge to 
dramatize her feelings is strong, as it is to attract attention. 
Whenever she neared discussion of emotionally charged 
personal material, she conjured some rationalization to dis- 
continue her visits temporarily. Her most extended period 
of absence of spastic voice symptoms occurred during the 
second year of therapy with the second therapist. For 
almost three weeks, she spoke in her previous normal fash- 
ion. With her mother’s death, her spastic syndrome rapidly 
returned. Since then, she is sometimes a little improved, 
sometimes notably worse, and on a few transitory occa- 
sions she was free of dysphonia. 
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P: Yes. Nine years ago | had a little boy, he was seven 
months old, he is nine now, and | would just have flashes 
of something and.... 

C: Similar to what you have now? 

P: Yes, it was frightening whatever it was; | couldn’t 
seem to control it. It only happened a few times and | went 
on a vacation and it came back. 

C: Same kind of voice that you had seven and a half 
years ago? 

P: Well, | can’t really remember it. 

C: But the present problem began in earnest seven and 
a half years ago. 

P: Yes. 

C: What did you notice first? Was it the same kind of 
voice that you have now, or was it something different? 

P:Oh... well, | can’t really remember it. It is just hard to 
speak. Sometimes shakiness. | had left my husband. | was 
living with my daughter who was going to college, and 
I’ve had a lot of trauma the last nine years. 

C: Do you remember exactly or approximately when the 
voice trouble began? Was it while you were pregnant or 
after the birth of your child? 

P: While | was pregnant, and then that summer all of a 
sudden | had gone back on the ranch and | couldn’t speak 
at all, it didn’t really happen suddenly but probably over a 
period of a week. 

C: And has the voice trouble been present ever since? 

P: More or less. Sometimes it does get better. 

C: Has it ever disappeared completely? 

P: No. Only three times when | drank way more than | 
should have. Usually drinking makes it worse. 

C: Under what conditions will it be better? 

P: Well, | can’t really say. 

C: Under what conditions will it be worse? 

P: Well, when I’m under extreme stress, and talking 
about it makes it worse. 

C: And what previous help have you sought for the 
problem? 

P: First, seven and a half years ago, | went to a throat 
specialist twice. The first time he gave me cortisone and 
| went home and took it faithfully, and then after two days 
| couldn’t talk at all. My husband had to call him back, so | 
went back and he gave me Valium, and that is the last 
time | ever saw him. 


Hostility, verging on rage, inability to verbalize aggression, C: So, except for a brief episode or two about nine years 
and entrapment in life circumstances with no apparent ago, the voice trouble really started about seven years ago. 
means of escape, so common in the conversion reaction type P: Yes. 
of adductor spasmodic dysphonia, is further exemplified in C: Now, was that period in your life a significant change 
the following case study of a 45-year-old woman who had over the way it was before that time? 
been diagnosed as having adductor spasmodic dysphonia P: Well, | guess, because | have a troubled marriage, 
several years before this interview: that got worse. It has never gotten better. | had left my 
husband and after my baby was born | had gone back. It 
was just a real bad time. 

C: At the present time what would you consider the 
major problem in your life? 

Clinician: When did you first start having trouble with P: My marriage. 
your voice? C: Would you describe that in as much detail as you can? 

Patient: Well, seven and a half years ago when | was P: Well, it’s bad. | don’t see any way out, because | have 
pregnant with my little girl. these two perfectly delightful children who are happy 

C: Had you ever had any such trouble before? and secure and they have a wonderful home in a small 


Case Study 6.2 
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town. Their life is just ideal, and | just can’t ruin it for 
them. 

C: Why do you say that there is no way out? 

P: Well, I’ve been married for 30 years, and now | was 
thinking about it last night, for the last year | think I’ve just 
had a few emotions. My love for my children and then all 
the rest of them are just completely negative. All hate, 
which I know is the worst thing that you can . . . it is very 
devastating. 

C: What do you mean hate? 

P: Well, | just don’t like my husband. 

C: Why not? 

P: Oh, it has gone on for so long. When | was 18 | thought 
he was great; he is seven years older than | am, and | just 
kind of married him out of, just for security. | was kind of 
the first hippie, or | feel like | was, and | thought he was 
nice, but now I’ve grown up and | know what he is like, and 
I’m sure, well, we just both tried to destroy each other. | 
feel as though he has tried to destroy me, and | feel like I’ve 
given him way, way more, but he is very, very unyielding 
and completely unable to . . . | work very, very hard and 
have tried to suppress all kinds of emotions or feelings. The 
only time | really let go is in a screaming fight. 

C: How is your voice during those fights? 

P: Well, it comes out pretty good. That’s why he says 
“there is nothing wrong with your voice now.” Of course, 
it is still very forced, but sometimes | feel like my voice is a 
substitute for crying, because | have quit crying. | decided 
a long time ago that that is really useless to do. Some- 
times | think it was good for me to cry, and now | can’t. | 
had breast cancer a year ago last February and had a mas- 
tectomy, and my husband knew that | had a lump for four 
to six weeks. When | discovered it | knew immediately that 
it was bad, and | called him into the bathroom and he said, 
“Oh | knew you had that four to six weeks ago, and | said 
‘why didn’t you tell me.’ “ And he had come up behind me 
and grabbed me which makes me want to kill, and | guess 
| elbowed him or he said, “Oh, you’re so mean” or some- 
thing, and now I’m really very bitter about that. | keep 
telling him that | can’t forgive him for it, but he is the kind 
of person that he can’t even yield enough to say, well | did 
make a mistake, or | don’t know why | did that. He can’t 
even do that. He just says, “Well, if you’re so stupid that 
you can’t find your own lump” more or less to that effect. 
Oh this is so painful. 

C: It was four weeks before. 

P: He said four to six weeks. 

C: So when was that? 

P: Well, | had my mastectomy a year ago, last February. 

C: So that certainly didn’t improve the situation. 

P: No. Not my marital situation. | don’t think it affected 
my voice any. 

C: But it reinforced your feelings toward your husband. 

P: | don’t understand. | do understand him in a way, 
because | know the rest of his family. 

C: But the marriage was in trouble before your voice 
trouble started, wasn’t it? 

P: Yes. 

C: And yet you had no trouble with your voice seven 
years ago. 





P: No. I love to talk. | used to, but, of course, now | don’t. 

C: Well, what do you see for yourself in the future? 

P: Well, about the only solution, he says that he will go 
to a marriage counselor. He is such a contradictor. He has 
gotten totally dependent on me. He annoys me because 
he thinks he has to talk about everything. It’s like some- 
one thinking aloud. He just talks me to death telling me 
what he is going to do. If he is going to go up the hill and 
feed the horse he has to tell me. He feels threatened, | 
know, about all kinds of things. 

C: Like what? 

P: Well, | know that he thinks that | am independent and 
strong, and I’m sure that he feels threatened by that, and 
when | do things on my own | know that he feels threat- 
ened by that. He is terribly jealous. Terribly, and we don’t 
like the same people, or he doesn’t fit in with the people 
that | like. Of course, in the town | live in| don’t have... | 
like where | live. | love my home, and it would be very hard 
to leave, but | don’t really have any close friends that | 
relate to. | have the kind of friends that | have to use poor 
grammar around every once in a while, or they would 
think that I’m, you know, trying to be something else. | 
hate to say it without sounding snobbish, because | really 
am fond of the people that are good to me, but some- 
times I’d like to go to the university and take a course in 
creative writing and an art course. | do a lot of painting, | 
do some rosemaling, and | feel deprived in that way. | 
really don’t feel like a martyr for my children, because they 
are the most important thing that | have. 

C: So you have said, then, that your husband has felt 
jealous and threatened and at the same time you'd like to 
go back to school and take some courses. 

P: You asked me what | saw ahead. | told you what I’d 
like to see. This is what | do see ahead. He did say that he 
would see a marriage counselor. | don’t know what they 
are like, but | know we have to work something out; we 
have to. We just can’t go on like we have this summer. But 
last summer he changed. | think possibly he could have 
been having affairs, but that doesn’t bother me all that 
much really, but it has been bothering me that he seems 
to be regressing. | feel as though he is growing old without 
ever growing up. He doesn’t have any interests; he is not 
doing anything in particular. He just likes to hang around 
the house. 

C: Is that threatening to you? 

P: Well, it annoys me, because he doesn’t have any, you 
know, | think he should have interests. The people in his 
family have an idea that the older you get the easier you 
should take it. You know, you should just sort of roll into a 
steady decline. And my life isn’t going to be long enough 
for me to do all the things | want to do, and | just can’t 
stand people like that. And he doesn’t like it; he hates my 
reading; he doesn’t like books. He is very scornful of peo- 
ple .. . and he never reads. He does better than he did 
when | first married him, because he only read comic 
books then. | should have realized then that a 25-year-old 
man that read comic books was a little bit . . . But he is 
intelligent. 

C: Why do you say that you are not going to live long 
enough? 


P: Well, I mean I have so many things | want to do. | don’t 
have any idea that I’m not going to last very long, because 
| promised my kids that l'Il live forever. No, | don’t want you 
to think I’m making a death wish or anything. No. 

C: So you and he think something needs to be done? 

P: It has to be. We can’t go on this way, because finally 
I’ve given up keeping a pretense by trying to move out of 
his bedroom. Up until a month ago | still tried to be pleas- 
ant by talking to him as much as | could, but finally he 
wanted to drag me to an alumni thing at his old high 
school, and | just couldn’t face it, especially not with my 
voice being bad. People | don’t care for and | just couldn’t 
face it. And we had just had a disastrous trip. 

C: Disastrous? 

P: Well, it was just disastrous. We hated it. | hated him. 
We went to see my daughter, and she was having difficul- 
ties with her husband. Her husband is quite a bit older 
than she is, and he is the very same kind of person. They 
both have the very same ideas about what a woman or a 
wife should be and do, and what a man should be and do, 
and how it should be, and the two of them were buddy- 
buddy. So when he wanted me to go on this trip, | knew 
everything was going to hit the fan, and | knew it wasn’t 
very smart of me, but I’m so tired of pretending. | just said 
| made up my mind when | got back that I’m not even 
going to ride in town with him let alone go on a weekend 
trip. And of course, you can guess what all that led to. 

C: What? 

P: Well, just terrible, terrible things that | said to him and 
he said to me. But we don’t fight in front of the kids. No 
one but my closest friends know that I’m anything but 
Miss Mary Sunshine. My voice. This is the worst it has 
been, just talking about these things. It is very difficult, 
but usually when I go into town and buy groceries and 
take the kids to the swimming pool | can get by, but my 
whole life is a pretense except when I’m with my kids. 

C: Again, your whole life is a pretense. 

P: Well, | mean because | am keeping up a front, you 
know, you don’t go around telling everybody you’re not 


happy. 

C: Have you been getting unhappier as the years have 
gone by? 

P: Yes, well, we’ve hit a new low. 

C: Why? 


P: Because we're fighting. 

C: You are fighting more this summer than ever before 
you think? 

P: Yes. 

C: Why do you think that is so? 

P: Well, because | finally just told him 1, well, something 
has happened to me, because | was always able to try and 
come around and say, Oh he’s not such a bad guy, and | 
guess l'Il be able to stick it out till, you know.... 

C: Could you summarize for me what it is about him 
that you find so objectionable? 

P: | feel that he is unyielding and he is a chauvinist. He 
just can’t accept letting me just be. 

C: What is the nature of this relationship that makes you 
feel as antagonistic toward him as you do? What is there 
about him? 
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C: He has a cruel streak. Very cruel. 

P: What do you mean? 

C: Oh, things that he says to me. Of course, | say cruel 
things to him, too. But | think he hits deeper than | do, | 
don’t know. | hated my father. | just completely disassoci- 
ated myself from my mother and sisters. Oh I didn’t want 
to get into that. | don’t know .. . Oh, | said | hated my 
father. Well he reminds me of my father. 

C: Your husband has many of your father’s traits? 

P: Yes, in fact most of them | think. But he certainly 
would have been a complete opposite as far as | can see 
when | married him. 

C: You didn’t see that other side to him? 

P: No. | didn’t deliberately choose someone who was 
like my father. 

C: What has the sexual part of the marriage been like for 
the past seven years? 

P: Oh, pretty fair. Of course, there hasn’t been any sex 
for the past probably four months. And for me | don’t 
want him to touch me, ever. 

C: Up until that time you were having a normal sexual 
relationship? 

P: | would say that it has never been enough for him, 
which turns me off. 

C: But, for the past four months there has been no sex- 
ual activity at all. 

P: No. 

C: What caused that? 

P: Well, | don’t know. | just plain moved out of the bed- 
room. 

C: Why did you do that? 

P: Because . . . he just repels me. 

C: Why now? 

P: Well, | don’t like him. If you don’t respect . . . you cer- 
tainly have to like someone you have sex with and I don’t 
like him. 

C: Have you told him that? 

P: Well, I think it is obvious to him that I don’t like him. 

C: What was his reaction to this move on your part? 

P: Well, his idea of a confession is to, after we’ve had a big 
fight, never to admit anything but to come up and say “Can 
| mix you one?” “Can I get you a cup of coffee?” And that is 
supposed to be his wanting to make up. Everything is just 
supposed to be forgotten. He has made these overtures to 
me wanting to just forget the whole thing, and he hasn’t 
said a word about my moving out of the bedroom. We 
don’t even talk about sex any more. It used to be a big has- 
sle but there, it’s just, | can’t give you any answers without 
talking for an hour on each thing. | can hardly do it. [Pause] 

C: We are not completely sure that this voice disorder is 
caused by marital problems, but | think in this case there is 
a good chance that it is. 

P: Except that there have been times when I’ve been 
extremely happy, at least thought | was, and relaxed and it 
has still been bad. 

C: At any rate, before saying it is or it isn’t, | think we are 
going to need a psychiatric opinion about this. | have a col- 
league in psychiatry who has been seeing patients of mine 
who have voice disorders for several years. And | would 
like him to see you if that is all right with you. 
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P: Yes. 

C: Even if you had no voice problem, it would still be def- 
initely indicated for you to see somebody, as I’m sure you 
have already concluded that yourself. 

P: | know. 

C: “Something has got to be done”; those are your 
words and there is no time like the present, in my opinion. 
What is yours? 

P: The same. That’s why I’m here. Oh, something else. 
Did | mention the fact that I’m sure that he has never 
wanted me to have, to do anything about my voice? I’m 
sure of that. 

C: What makes you say that? 

P: Well, because he has never encouraged me to, or 
never said, “Why don’twe...,” he says he loves me but he 
has never said “Why don’t we take you somewhere and 
see what we can do?” | had a few sessions with a speech 
pathologist last summer, and it kind of helped because 
just, temporarily, | was enjoying the idea that | was doing 
something about it, but he never asked me a question 
about it. Or he didn’t say “You’re better,” or “How are you 
doing?” or “What are they doing with you?” or anything. 
Not one word. And when | wanted to come here my hus- 
band informed me that he wasn’t going to pay for any trip. 
So | borrowed the money to come here. 


In a study of the possible psychiatric background in patients 
with adductor spasmodic dysphonia, Aronson, Brown, Litin, 
et al. (1968a) submitted 29 of 34 patients to psychiatric inter- 
views averaging 2 hours in duration. In 18 (62%), the psychia- 
trists judged the histories and personality characteristics to be 
indicative of psychoneurosis; 5 had significant psychiatric 
problems and 13 presented with equivocal psychiatric find- 
ings. Eleven patients were psychiatrically normal. 

As a group, the 62% who had emotional problems of vary- 
ing degrees—regardless of whether or not these problems 
can be considered causally related to spastic dysphonia— 
possessed the following traits: (1) They were compulsive 
and perfectionistic, not very tolerant of error in others or 
themselves, and had to do things “right.” (2) They had a ten- 
dency to suppress their anger resulting from their relation- 
ships with other people. (3) They had lifelong tendencies 
toward verbal repression rather than free expression of 
ideas. (Not all patients had all three traits.) The psychiatrists 
agreed that: 


Generally, these patients were rigid, conscientious people 
who took their responsibilities very seriously. They tended 
to be worriers and perfectionistic. In addition, and com- 
mon to many psychiatric conditions, their means of deal- 
ing with their feelings of anger, resentment, and irritation 
were faulty and inefficient; instead of expressing their 
angry feelings in a socially acceptable way, they usually 
would choose to keep their anger to themselves. With dis- 
concerting frequency, these patients described themselves 
as “the quiet ones” in their families, from childhood on. One 
or both parents or a sibling was often the dominant one as 
far as verbal expression was concerned and these patients 
had little opportunity or need to participate verbally 
(Aronson, Brown, Litin, et al., 1968a, p. 215). 


A review of the Aronson, Brown, Litin, et al. (1968a) study 
shows reasonable psychogenic certainty in 5 of the 29 
patients who had undergone psychiatric examination. How- 
ever, 13 produced equivocal information and 11 were 
thought to be free of any evidence of psychiatric disorder. 
Only in retrospect do these discrepancies become explain- 
able. The study had been done when its authors were just 
beginning to become aware of neurologic etiology. To be 
noted is that 71% of the patients in the study had voice 
tremor. Probably the authors had reported on the psycho- 
logic results of a heterogeneous group of adductor spas- 
modic dysphonic patients whom they had regarded as 
homogeneous. The neurologic patients, in other words, were 
mixed in with the psychogenic types. 

Brodnitz (1976) describes 53 of his spastic dysphonia 
patients who volunteered, under careful probing during the 
initial interview, what they considered to be the events that 
triggered the development of their disorders. Examples are 
death of a near relative, 13 cases (five from laryngeal cancer); 
severe automobile accident without personal injury, eight 
cases; climactic marital crisis, eight cases (one starting sud- 
denly in a divorce court after verbal abuse by the opposing 
attorney); laryngeal surgery for removal of benign lesions, 
five cases. One woman became dysphonic after her husband 
threatened to kill her, another after her husband was mur- 
dered, a third after her brother was shot to death ina holdup, 
a fourth after observing her child’s first seizure. 


Case Study 6.3 


A young psychiatrist was told that his training in analysis 
was complete. A few weeks after his cutting of the analyt- 
ical umbilical cord, he developed spastic dysphonia. He 
discussed it with his analyst who agreed to take him back 
for three more months. Immediately his voice became 
normal only to revert to the dysphonia when the analyst 
told him at the end of these three months that he should 
now be able to stand on his own two feet (Brodnitz, 1976, 
pp. 212-213). 

Aside from these 53 patients who could pinpoint the 
onset of the disorder, there were 28 who exhibited many 
of the characteristics of severe neuroses. Eight of them 
had already had long periods of psychiatric treatment 
before they developed spastic dysphonia. This leaves only 
about one third of the cases reported in this paper where 
no obvious psychogenic background could be found. This 
is not an unusual situation in psychosomatic disorders, 
particularly in manifestations of conversion hysteria. In a 
summary of 562 spasmodic dysphonia patients seen at a 
neurological institute, Brin, Fahn, Blitzer, and Ramig 
(1992) report that only 17%, or 98 patients, had a second- 
ary dystonia of some type, but they do not specify what 
fraction of the 17% is psychogenic. (They consider psy- 
chogenic dystonia a form of secondary dystonia because 
the phenomenology is dystonic and the etiology is psychi- 
atric.) Brin and his colleagues do, however, state “We sus- 
pect that psychogenic spasmodic dysphonia is very rare, 
and indeed have never seen a confirmed case. . . . we sus- 
pect that we have not seen these patients because they 
are treated by speech language pathologists and are not 
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referred for evaluation”. “Of course, everybody who has 
had frequent contact with cases of spastic dysphonia has 
entertained the suspicion that an organic neurologic fac- 
tor may be the etiological basis of the more severe forms 
of the disorder. Possibly, future research with more 
refined methods may demonstrate such a connection in 
some cases” (Brodnitz, 1976, p. 213). Indeed, during the 
past two decades, numerous histologic, neurophysio- 
logic, magnetic resonance imaging (MRI), functional 
magnetic resonance imaging (fMRI), and positron emis- 
sion tomography (PET) studies have been completed in 
an attempt to determine the neurologic locus of spas- 
modic dysphonia. Thus far, they have failed to locate con- 
sistent abnormalities within laryngeal nerves (Carlson, 
Izdebski, Dahlqvist, et al., 1987; Ravits, Aronson, 
Desanto, et al., 1979) or any particular brain structure or 
region (Aronson and Lagerlund, 1991; Devous, Pool, and, 
Finitzo, 1990). In their scholarly review of historical 
approaches to the treatment of adductor spasmodic dys- 
phonia (ADSD), Pearson and Sapienza (2003) state that 
whereas ADSD is a voice disorder of uncertain origin, the 
origin is likely to be neurogenic and that current theory 
views ADSD as a problem of motor circuitry rather than a 
byproduct of damage to any one cortical or subcortical 
area. 


Adductor Spasmodic Dysphonia of Musculoskeletal Ten- 
sion Reaction Some patients develop the voice of adductor 
spasmodic dysphonia on the basis of a relatively simple 
muscle tension reaction also called vocal hyperfunction. 
Although these dysphonias are technically psychogenic, 
caused by heightened anxiety, tension, or depression, they 
are not the more serious conversion disorder types. 

Increased musculoskeletal tension as a cause of the voice 
symptoms of adductor spasmodic dysphonia has been called 
the tension-fatigue syndrome, the components of which are 
hoarseness, often of the strained variety, vocal fatigue, poor 
respiratory support, and pain on phonation. The dysphonia 
of musculoskeletal tension or the tension-fatigue syndrome 
can have all of the earmarks of adductor spasmodic dyspho- 
nia. This category of voice disorder is highly amenable to 
symptomatic voice therapy (Koufman and Blalock, 1988; 
Roy, Ford, and Bless, 1996). 


Neurologic Adductor Spasmodic Dysphonia 


This section endeavors to demonstrate that strained, stac- 
cato, effortful voices within the perceptual definition of 
adductor spasmodic dysphonia can be caused by, and are 
often signs of, two major categories of neurologic disease or 
syndromes and their subsidiaries: 
1. Organic (essential) tremor 
2. Dystonia 

a. Meige’s syndrome 

b. Spasmodic torticollis 


c. Mixed dystonia-tremor 


The following preliminary points about these syndromes 
should be noted: 


+ The dysphonia perceived during contextual speech is 
superficially indistinguishable from the psychogenic 
forms. Unless neurologic signs in addition to the dyspho- 
nia are present, and often either they are not or they are 
so subtle as to go unnoticed, it is almost impossible to tell 
the neurologic from the psychogenic forms. 


+ The above syndromes are established organic disorders 
recognized by academic and clinical neurologists. Addi- 
tionally, there are recognized nonlaryngeal focal dystonias 
that include blepharospasm, oral mandibular dystonia, 
and writer’s cramp, which similar to spasmodic dysphonia 
are uncontrolled and abnormal muscle contractions (Lud- 
low, 2000). 


@ They are caused by a lesion or lesions within the 
extrapyramidal system. 


+ The motor signs of these syndromes do not express them- 
selves identically in all patients, who differ from one 
another according to how much of the body is involved, 
the range extending from a single muscle group to the 
entire body; the onset of the disease, which may begin in 
one organ or structure and spread to others, or, the dis- 
ease may remain confined to a single organ; severity, 
which may range from threshold to severe levels; if the 
disease affects the respiratory, phonatory, resonatory, and 
articulatory musculature, in which case the result is a 
hyperkinetic dysarthria of which strained, effortful, or 
staccato voice is but one component; and the site of the 
disease. When the disease expresses itself solely or pre- 
dominately in the larynx, the shift of perceptual attention 
to the voice induces the clinician also to shift the termi- 
nology to spasmodic dysphonia. Technically, then, the 
adductor dysphonia of these neurologic syndromes are 
dysarthrias or components thereof. 


+ The unusualness and often bizarreness of the movements 
in these syndromes routinely lead less experienced clini- 
cians of all specialties to diagnose them incorrectly as 
psychogenic and to rationalize the unusual movements, 
for example, tic-like contractions, facial grimaces, and 
blepharospasm (squinting and frequent eye blinking), as 
secondary psychogenic struggle reactions when, in fact, 
they are signs of the disease. 


+ Confusing and misleading to clinicians unless they have 
been forewarned is that neurologic movement disorders 
are often forced into the open for the first time by acute 
emotional trauma or chronic stress and, once in evidence, 
usually worsen with the pressures of daily life or fatigue 
and improve when the patient is relaxed. Emotional 
stress as the trigger of organic disease, well known in gen- 
eral illness, is common throughout the histories of 
patients who develop many different kinds of neurologic 
diseases. The idea that sudden shock or protracted aggra- 
vation can produce irreversible structural or neurochem- 
ical changes in the nervous system may tax credibility, 
but experienced neurologists who have witnessed this 
correlation unreservedly support this concept. 
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A. Organic Voice 
Tremor 


Figure 6.4 Oscillographic tracings compar- 
ing organic voice tremor without voice arrest 
and with voice arrests during vowel prolonga- 
tion. (A) Organic voice tremor without voice 
arrests. (B) Adductor spasmodic dysphonia of 
organic voice tremor. Note the rhythmic voice 





B. Adductor Spastic 
Dysphonia of Organic 
Voice Tremor 








AACR TA N 


Adductor Spasmodic Dysphonia of Organic (Essential) 
Voice Tremor Noted in Chapter 9 are several important 
points. (1) The quavering voice called organic or essential 
voice tremor is often a component of the essential tremor 
syndrome. (2) The voice tremor can involve the larynx only. 
(3) The rate of the voice tremor ranges from 5 to 12 Hz. (4) 
Onset can be sudden or gradual. (5) The voice tremor can be 
best demonstrated during vowel prolongation. (6) The per- 
ceived tremor is produced by alternating adduction and 
abduction of the true and, often, false vocal folds, which may 
be synchronous with vertical oscillations of the larynx. (7) 
The perceived tremor may result from involuntary rhythmic 
movements of the intrinsic or extrinsic laryngeal muscles or 
pharynx. (8) The voice tremor can be smooth (Fig. 6.4A) or it 
can be interrupted by hoarse voice arrests. (9) When organic 
voice tremor becomes sufficiently severe, the range or 
amplitude of the vocal fold adduction is so extensive that 
they momentarily meet in the midline and seal the glottis, 
producing a strained or staccato voice arrest. Repeated voice 
arrests cause the voice to lose its qualities of tremor and take 
on those of adductor spasmodic dysphonia (Fig. 6.4B). (10) 
Clinically, adductor spasmodic dysphonia of organic tremor 
can be confirmed by rhythmic, semirhythmic, or irregular 
voice arrests during vowel /a/ prolongations that may or may 
not be separated by several cycles of voice tremor! or by 
associated tremor of the head, lips, mandible, tongue, velum, 
pharynx, and thorax during phonation and of the upper 
extremities. The differential diagnosis between spasmodic 
dysphonia due to essential tremor versus that due to a dys- 
tonic tremor can be difficult (Brin, et al., 1992). 

The evidence linking adductor spasmodic dysphonia to 
essential tremor follows. 


Spastic dysphonia in two patients with heredofamilial 
tremor was reported by Critchley (1939, et al., 1949). 


1In very severe cases, the compensatory spasms may be sufficiently intense 
to produce a continuously strained voice rather than tremor on vowel pro- 
longation. Raising the pitch level may enable voice tremor to break through 
when produced on an extended vowel. However, these corroborative signs 
may be absent or so mild as to go unnoticed by all except those trained to 
recognize them. 
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arrests. (From Aronson, A.E., Hartman, D.E. 
[1981]. Adductor spasmodic dysphonia as a 
sign of essential [voice] tremor. J Speech Hear 
Disord 46, 52-58.) 
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Voice arrest cycle 


+ In a neurologic study of 10 spastic dysphonia patients, all 
10 had scattered neurologic signs. One of the patients had 
heredofamilial tremor involving the head and both upper 
extremities (Robe, Brumlik, and Moore, 1960). 


+ A neurologic and psychiatric study of 31 patients dis- 
closed regular voice tremor during vowel prolongation in 
71% of patients. Six patients had accompanying head or 
hand tremor (Aronson, Brown, Litin, et al., 1968b). 


+ Avideofluoroscopic study of three patients showed inter- 
mittent voice stoppages associated with complete glottic 
and supraglottic closure and tremor of the larynx, 
hypopharynx, and tongue during quiet respiration and 
contextual speech and were consistent with the patients’ 
tremulous voice quality. Tremor amplitude increased 
markedly during intermittent voice stoppages (McCall, 
Skolnick, and Brewer, 1971). 


+ In a study of 21 patients with voice tremor, 10 were 
described as having spasmodic dysphonia. In three there 
was a family history of tremor. Vowel prolongations 
showed irregular intervals of complete voice arrest at 
around 6 Hz (Dordain and Dordain, 1972). 


+ In 12 patients, tremulous, quavering phonation was found 
in at least three and postural tremor in three. In one case, 
tremor elsewhere in the body appeared 5 years after 
onset; in another, 1 year after onset. In one patient, 
tremor had appeared 20 years before onset of the voice 
disorder (Aminoff, Dedo, and Izdebski, 1978). 


+ A study compared three groups of patients, one consist- 
ing of 14 essential tremor patients without voice arrest, 
a second of 16 essential tremor patients with occasional 
voice arrests, and a third of 22 patients suspected as 
having the tremor type of adductor spasmodic dyspho- 
nia. On vowel prolongation, median voice tremor fre- 
quencies of 5.7, 5.0, and 5.5 Hz. were found in the three 
groups, respectively. The differences were statistically 
insignificant. Tremors in other parts of the body were 
found in 93% of the first group, 81% of the second group, 
and 50% of the third group (Aronson and Hartman, 
1981). 
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Though difficult, it is important to try to differentiate the 
regular tremor seen in benign essential voice tremor from 
the irregular dystonic tremors. Based on electromyographic 
recordings, Blitzer (1985) noted that 25% of his laryngeal 
focal dystonic, or spastic dysphonic, patients had irregular 
tremors. Unfortunately, when patients present with symp- 
toms of essential tremor in other parts of their bodies, the 
clinical distinction cannot always be made (Aronson, Brin, 
and Ludlow cited in Brin, et al., 1992). 


Emotional Stress as a Precipitant of Adductor Spasmodic Dys- 
phonia of Organic (Essential) Voice Tremor One would hope 
for the sake of simplicity that all psychogenic adductor spas- 
modic dysphonias showed a history of acute or chronic 
emotional stress and that absence of such a history would 
characterize the neurologic types. Regrettably, and remark- 
ably, such is not the case, for convincing evidence from sev- 
eral different sources shows that both essential voice tremor 
and the adductor spasms of essential tremor frequently fol- 
low emotional stress. That stress can produce permanent 
neurologic change is based on indirect data correlating life 
history with onset of the neurologic disorder. However, the 
time and circumstantial relationship is so close in single- 
case and group studies as to be highly convincing. As noted 
elsewhere, the concept that stress can precipitate incipient 
neurologic disease or be a primary cause of it is accepted 
doctrine in medicine and is not considered to be particularly 
unusual. Examples of case studies reported in the literature 
that seem to support this notion follow: 


+ A 72-year-old woman with a 20-year history of face and 
hand tremor developed voice tremor shortly after the 
death of her husband (Ardran, Kinsbourne, and Rush- 
worth, 1966). 


In three patients who had the tremor type of spasmodic 
dysphonia, the voice and associated tremors occurred 
after emotional shock several days after the traumatic 
event (Dordain and Dordain, 1972). 


+ Emotional stress was associated with onset in two groups 
of essential voice tremor patients, 29% and 25%, and in 
41% of adductor spasmodic dysphonias of essential 
tremor (Aronson and Hartman, 1981). 


The kinds of stress that precipitate adductor spasmodic 
dysphonia of organic voice tremor are not qualitatively dif- 
ferent from those responsible for the psychogenic type. 
Death of a relative and violation of mores within the family 
are common themes responsible for both the psychogenic 
and neurologic types. However, many more neurologic than 
psychogenic patients give histories free of such stresses and 
appear to have well-adjusted personalities. 


Adductor Spasmodic Dysphonia of Dystonia Evidence 
strongly supports the concept that adductor spasmodic 
dysphonia can also be a laryngeal sign of dystonia. As 
noted in Chapter 9, the larynx can be involved in the 
hyperkinetic dysarthrias of which dystonia is one type. 
Dystonia is a general term referring to a group of isolated 
neurologic signs or complete syndromes. Potentially con- 


fusing are several synonyms used to describe these signs: 
Meige’s syndrome, Brueghel’s syndrome, orofacial dyski- 
nesia, and focal dystonia. Spasmodic torticollis, athetosis, 
and writer’s cramp also fall within the purview of the dys- 
tonias. Spasmodic torticollis as a specific type of dystonia 
can coexist with dyskinetic facial and lingual movements 
and even tremor. 

Dystonia refers to patients who have relatively slow, 
adventitious, uncontrolled, nonrhythmic contractions of the 
lips, mandibular muscles, tongue, velopharyngeal muscles, 
larynx, thorax, or extremities. Dystonias are caused by 
lesions of the extrapyramidal system. The contraction of dif- 
ferent muscle groups can occur in any combination. Often 
they are confined to one region of the body, such as to the 
head and neck musculature. They can even involve a single 
muscle group, especially in the early stages of the disorder if 
it is progressive; in many patients it is static. Laryngeal 
dystonias are characterized as abnormal involuntary move- 
ments that are typically action-induced by the act of speak- 
ing. Because the resultant dystonic movements are unusual 
and rare, dystonias are one of the most frequently misdiag- 
nosed neurologic conditions (Nutt, Muenter, Aronson, et al, 
1988) including those emanating from the larynx. 

The evidence will show that adductor spasmodic dyspho- 
nia can be an example of a dystonia isolated to the larynx 
(i.e., a focal laryngeal dystonia) or it can be one component of 
a more generalized dystonia. As mentioned in the previous 
section on spasmodic dysphonia of organic voice tremor, 
when dystonia is focal to the larynx, the tendency is to 
describe the effect as spasmodic dysphonia. When addi- 
tional speech muscles are affected, the tendency is to call the 
entire speech disorder hyperkinetic dysarthria. 


Meige’s Syndrome Meige’s syndrome, originally called 
medial facial spasm, is a type of dystonia that consists of 
symmetric dystonic spasms of the facial muscles, which may 
first involve the eyes, producing blepharospasm, and later 
the remainder of the facial muscles, producing lip retraction 
and uncontrolled jaw opening and closing. Day-to-day fluc- 
tuations are typical, aggravated by stress, improved by seda- 
tion, and disappearing during sleep. A similar syndrome is 
called Brueghel’s syndrome or blepharospasm-oromandibular 
dystonia syndrome (Tolosa and Klawans, 1979). 

Jacome and Yanez (1980) described a 49-year-old man 
who had typical spasmodic dysphonia for 3 years that began 
with blepharospasm, symmetric facial spasms, and mild jaw 
retraction. Symptoms improved with rest and were worse 
under stress. He could talk and sing normally at times. “We 
believe this patient has a combination of Meige’s disease and 
spastic dysphonia,? with blepharospasm and medial facial 
spasms as described by Meige, as well as typical spastic dys- 
phonia.” Marsden and Sheehy (1982) found that the most 
common speech difficulty of patients with the dystonic 
spasms of Meige’s syndrome was “having to force air 
through a tight throat and mouth resulting in a strained, 


?They might have rephrased their statement to say Meige’s disease, of 
which spasmodic dysphonia was a sign or component, for that is probably 
what they had intended. 
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monotonous, slurring speech, which resembled closely the 
classical description of spastic dysphonia. . . . Since spastic 
dysphonia may occur in the same syndrome, it is quite likely 
that isolated spastic dysphonia itself may be a sole focal mani- 
festation of dystonia [emphasis mine]. Indeed, we have 
encountered a small number of patients with speech distur- 
bance indistinguishable from that seen in Meige Syndrome, 
patients who would be classified as cases of spasmodic dys- 
phonia but which we believe to be examples of isolated focal 
laryngeal dystonia.” 

Jankovic and Ford (1983) studied 100 patients with signs 
of dystonia. Twenty-one were found to have spastic dys- 
phonia. In 15, the dysphonia was accompanied by ble- 
pharospasm, orofacial movements, and tremor. In six 
patients, spastic dysphonia was the only sign. In view of the 
previous discussion of emotional stress as a precipitating, 
not a causal, factor in essential tremor and the spasmodic 
dysphonia of essential tremor, it is important to note that 
the dystonic patients in this study yielded similar psychi- 
atric signs: “Although we found no evidence to support 
hysterical origin of the facial movements, many patients 
had psychiatric problems which were preceded, followed 
by, or associated with the movement disorder. These 
include marked depressions (one-quarter of the patients), 
anxiety, obsessive-compulsive personality, schizoid per- 
sonality, space phobia, and a variety of other psychological 
abnormalities.” 

In a study of 10 patients with Meige’s syndrome during 
neurologic examination, Golper, Nutt, Rau, et al., (1983) doc- 
umented laryngospasm in five of the patients studied. One 
was described as having an alternating strained/strangled 
voice quality and voice stoppages; a second voice stoppage 
on vowel prolongations; a third voice stoppage and phona- 
tion on inhalation; a fourth strained/ strangled voice; and a 
fifth, a question of voice tremor. 

In 1875, Schnitzler (Kiml, 1963) described two patients 
with spastic dysphonia who had dystonic movements of the 
arms and legs. 

In three of 12 spastic dysphonic patients studied by 
Aminoff, Dedo, and Izdebski (1978), one had torsion dysto- 
nia, a second had buccolingual dyskinesia, and a third had 
blepharospasm. The authors stated: 


Our own findings would certainly suggest that spasmodic 
dysphonia should be regarded as a focal dystonia of the 
laryngeal musculature. .. . The high incidence of tremor 
among our patients with spasmodic dysphonia accords 
with the high incidence reported in patients with torsion 
dystonia... or related movement disorders such as spas- 
modic torticollis. 


Spasmodic Torticollis Spasmodic torticollis, also once thought 
to be a psychiatric disorder, is now regarded by most neurol- 
ogists as a dystonia caused by a lesion within the extrapyra- 
midal system. People who acquire this disease experience an 
uncontrollable pulling, twisting, or tilting of the head to one 
side. Variations may be a retrocollis in which there is an 
extension of the head backward, or a procollis, in which 
there is flexion of the head forward. The disease often shows 
signs of dystonic facial movements, blepharospasm, and 


dysarthria, including voice tremor. But, in addition to these 
classic signs, adductor laryngospasms may precede, follow, 
or parallel the development of spasmodic torticollis. Since 
Critchley’s (1939) early report of a patient with spastic dys- 
phonia who had spasmodic torticollis, the association 
between the dysphonia and this disease has been observed 
by many clinicians, particularly when the illness is in its full- 
blown stage. The association between torticollis, dystonia of 
other types, and tremor indicates the more than casual 
interrelationship among these various extrapyramidal syn- 
dromes. Schaefer (1983) also recognized the close connec- 
tion between adductor spasmodic dysphonia and spasmodic 
torticollis because of the frequent occurrence of both dis- 
eases in the same patient, the association of tremor in both 
diseases, and the similarities in the historical controversy as 
to whether spasmodic torticollis was psychogenic or neuro- 
logic. Once considered to be psychogenic, few would argue 
against the fact that spasmodic torticollis is now well 
accepted as neurologic. 

Looking back, what has been emerging is a picture of one 
form of adductor spasmodic dysphonia as a solitary or coex- 
isting sign of any one of several extrapyramidal movement 
disorders. These, it should be emphasized, are syndromes of 
the central nervous system. During a brief period in the 
search for neurologic etiology, the notion was advanced that 
the voice disorder could arise from a peripheral nervous 
system lesion, specifically of the tenth cranial nerve, and 
was of viral origin (Bocchino and Tucker, 1978; Dedo, Izdeb- 
ski, and Townsend, 1977). These clinicians based their 
hypotheses on the finding of what they thought were 
demyelinated nerve fibers in specimens taken from adduc- 
tor spasmodic dysphonic patients during nerve resection. 
However, Ravits, Aronson, Desanto, et al. (1979) were unable 
to find any evidence of differences between the nerve fibers 
in spasmodic dysphonic and normal subjects. On other 
grounds, it is unlikely that such a peripheral nerve lesion 
could produce massive spasticity; flaccidity is usually the 
result of such lesions. The hypothesis also could not account 
for periods of normal voice in many spasmodic dysphonic 
patients. 


Idiopathic Adductor Spasmodic Dysphonia 


The term idiopathic adductor spasmodic dysphonia is reserved 
for patients in whom no convincing evidence of psychogenic 
or neurologic etiology can be found. Some clinicians have a 
tendency to infer etiology without adequate proof, interpret- 
ing the patient’s dysphonia according to their own biases. To 
do so is not in the best interest of the patient or conducive to 
scientific thought. Loose diagnostic formulations given to 
the patient are usually taken to heart and can adversely 
influence the patient's self-concept. 

The category of idiopathic adductor spasmodic dysphonia 
is not diagnostic but a means of placing the diagnosis in sus- 
pense until further data have been acquired to show a defi- 
nite etiology. If the patient is suitably followed over a long 
period of time, a psychogenic or neurologic etiology often 
will reveal itself, permitting a change of classification. One 
example is that of a 34-year-old attorney who was referred 
by an otolaryngologist who cited adductor spasmodic 
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dysphonia as the only complaint. The psychiatric history and 
neurologic examinations failed to disclose any evidence of 
cause from either discipline. Three years later, on the 
patient’s return to the clinic, dystonic leg movements and 
procollis were found on neurologic examination, in addition 
to a worsening of his dysphonia. 


Summary of Adductor Spasmodic Dysphonia 


+ Adductor spasmodic dysphonia is characterized by a 
strained, groaning, effortful, staccato voice varying in 
qualities of these components among different patients 
and also in severity. 


+ The strained, effortful voice is the product of hyperadduc- 
tion of the true, and often false, vocal folds and accompa- 
nied by excessive extrinsic laryngeal muscle action that 
usually elevates the body of the larynx in synchrony with 
the spasms. 


+ The psychogenic causes of adductor spasmodic dyspho- 
nia are conversion disorder or musculoskeletal tension. 


+ The neurologic causes are organic (essential) tremor, oro- 
facial dyskinesia, focal dystonia or dystonia (Meige’s syn- 
drome), and spasmodic torticollis. These extrapyramidal 
diseases can occur singly or in combination. Their spas- 
modic dysphonias can occur in isolation in the beginning, 
and as the disease progresses, other signs will appear. 
Often, the dysphonia may be the only sign or the predom- 
inant one. 


+ It is advocated, therefore, that the type of adductor spas- 
modic dysphonia be designated in any ultimate diagnos- 
tic categorization (e.g., adductor spasmodic dysphonia of 
essential tremor, adductor spasmodic dysphonia of dysto- 
nia). If no specific cause can be found, the term idiopathic 
adductor spasmodic dysphonia should be employed. 


In most cases, adductor spasmodic dysphonia, regardless 
of etiology, begins gradually in the middle years, some- 
where between 40 and 50 years, although studies have 
shown onset in childhood. The ratio of males to females 
ranges from 1:1 to 1:4, depending upon studies con- 
sulted. 


For both psychogenic and neurologic types, a high per- 
centage of patients experience onset as associated with 
emotional stress, causal in the former type and as a trig- 
gering of the neurologic disease in the latter. 


Regardless of cause, the disorder usually produces serious 
occupational, social, and emotional disturbances. 


Treatment of Adductor Spasmodic Dysphonia 


Any consideration of therapy for adductor spasmodic dys- 
phonia must account for the following realities: (1) Not one, 
but several, different types of the disorder exist. Therefore, 
no single therapy can be recommended for all types. More- 
over, for any single type, there may be multiple treatments 
or combination of treatments recommended. (2) It is notori- 
ous for its fluctuations in severity in response to mood and 


environment. Therefore, valid, reliable assessment of the 
effectiveness of therapy cannot be made on the basis of 
office sampling alone but needs to be based on a cross sec- 
tion of communicative experiences. (3) Clinicians differ 
among themselves in their criteria for improvement, 
because the results of therapy are more often partial than 
complete. (4) Clinicians differ from patients in their criteria 
for improvement owing not only to differences in their 
expectations but also because improvement in the amount 
of physical effort required to phonate can only be evaluated 
by the patient and cannot be judged by an observer. (5) 
Studies of the effectiveness of therapy are difficult to com- 
pare owing to different methodologies, for example, deci- 
sion as to what constitutes adductor spasmodic dysphonia, 
failure to consider etiology with respect to homogeneity or 
heterogeneity of the patient population, method of meas- 
urement of the effects of therapy, and type and duration of 
follow-up. 


Therapy and Diagnostic Examinations 


Establishment of the different etiologic types of adductor 
spasmodic dysphonia has been based on after-the-fact 
research findings. However, in actual clinical practice, 
although etiology can be apparent in the evaluation, a differ- 
ential diagnosis among psychogenic, neurologic, or idio- 
pathic types may be difficult and not apparent until the 
patient is well into therapy. For example, dysphonias that 
disappear for extended periods of time in response to symp- 
tomatic voice therapy or psychotherapy are not likely to be 
of neurologic etiology. For this reason, in the interest of con- 
servative patient care, it is wiser to begin with a trial of 
symptomatic voice therapy and to reserve more risky types 
of therapy for intractable patients. 

What seems to be emerging through experience with 
voice disorders, but in particular with adductor spasmodic 
dysphonia, is that therapy is more often successful when it is 
based on a solid foundation of diagnostic examinations from 
an interdisciplinary team consisting of a speech-language 
pathologist, neurologist, otolaryngologist, and psychologist 
or psychiatrist. The information obtained is obviously of pri- 
marily differential diagnostic value, but not entirely. If the 
clinician assumes that all patients who have adductor laryn- 
gospasms are alike, the probability is high that the wrong 
therapy will be selected, and, because patient education is 
indispensable, the clinician will not be in a very comfortable 
position to convey sufficient or correct information. The pre- 
maturely informed, and therefore, the often misinformed 
patient is unusually common in cases of adductor spasmodic 
dysphonia. If at all possible, the following examinations 
should be collaborative. 


Laryngologic Examination At the earliest opportunity, the 
patient should be scheduled for a thorough laryngologic 
examination. Although the signs of adductor spasmodic dys- 
phonia are easily identifiable through listening, the ear, nose, 
and throat examination is important for several reasons: (1) 
The patient expects, and needs to be reassured by, a physical 
inspection of the interior of his or her larynx. (2) The clini- 
cian cannot confidently divest patients of their commonly 
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held beliefs that tumors or other structural lesions are 
causing their dysphonias unless the clinician is convinced 
on the basis of firsthand information that such lesions are 
absent. (3) A minority of adductor spasmodic dysphonic 
patients have vocal nodules, polyps, or contact ulcers trau- 
matically induced by their adductor laryngospasms, which 
contribute to the severity of their dysphonias. (4) Rhythmic 
adduction of the vocal folds producing voice tremor and 
rhythmic hyperadduction producing voice arrest can be 
viewed on laryngoscopic examination and is important 
diagnostically. 


Speech Examination A thorough speech examination is a 
necessity and should include a motor speech test for 
dysarthria. The reason for this prescription will be apparent 
if it will be recalled that spastic dysphonia may be of the 
neurologic type, and signs, often quite subtle, of tremor or 
movement disorder of the oral musculature may reveal 
information that can be of use to the neurologist and in plan- 
ning of treatment. 

Of all the speech tests that are usually administered dur- 
ing such an examination, none is more critical than vowel 
prolongation structured sentence production. The reason 
for sustained vowel production is that irregular laryn- 
gospasms during contextual speech may become regular or 
rhythmic during vowel prolongation, revealing that the 
patient’s spasmodic dysphonia is of the essential (organic) 
tremor type. Structured sentences contrasting words begin- 
ning with vowels with those beginning with fricatives help 
differentiate spasmodic dysphonia of muscle tension dys- 
phonia from other types (Idzebski, Dedo, et al., 1985; Roy, 
Gouse, Mauszycki, 2005). Also of particular importance is to 
obtain baseline audiotape and, if possible, videotape record- 
ings for a more detailed study of the patient and for com- 
parison with subsequent recordings, especially during and 
after therapy. Recordings may be used to quantify voice 
breaks and tremor type, regular or irregular. 

Musculoskeletal tension testing and brief efforts to reduce 
such tension through laryngeal manipulation, as described 
elsewhere in this book, is another indispensable portion of 
the examination. Patients who have such tension associated 
with psychogenic forms of the disorder will often respond 
with enough voice improvement for a sufficient length of 
time during the examination to convince the clinician, and 
the patient, of a nonorganic form of the disorder, thereby 
approximating the diagnosis and indicating the direction 
that therapy should most likely take. 


Psychosocial Examination No patient with the voice signs 
of adductor spasmodic dysphonia should be denied the ben- 
efit of an exploration of possible psychologic causes, precipi- 
tants, or effects of the voice disorder. As noted elsewhere in 
this book, a preliminary psychosocial history surrounding 
the onset and development of the dysphonia deserves care- 
ful investigation, preferably by the examining speech pathol- 
ogist or by a psychiatrist, psychologist, or psychiatric social 
worker of known qualifications. This examiner should col- 
laborate with the laryngologist and speech pathologist. Here, 
again, the patient will often reveal the etiologic diagnosis by 
providing psychodynamic information to help explain the 


voice disorder and, during the course of such revelations, 
demonstrate diminution or disappearance of the spastic dys- 
phonia. 


Neurologic Examination Referral of the patient for a neu- 
rologic examination is not routinely warranted, but if there is 
any doubt about the patient’s neurologic status, such referral 
can be highly clarifying, for example, when tremor or move- 
ment disorders are suspected of being related to the spastic 
dysphonia. 

Armed with data from these different disciplines, the 
speech clinician in collaboration with the other specialists 
can chart a plan of action to help the patient with this most 
incapacitating of all voice disorders. A review of the findings 
with the patient is always a wise decision. Information itself 
is therapeutic beyond question even when nothing can be 
done for the voice for the time being. Based on all of these 
data, the voice team must decide whether symptomatic 
voice therapy alone, psychotherapy alone administered by a 
qualified therapist, surgical treatment, or some combination 
in parallel is warranted, with the idea held in reserve that 
should these fail, more drastic measures may have to be con- 
sidered. The complexity of these decisions has been summa- 
rized by Pearson and Sapienza (2003) in their review and 
tutorial of the historical approaches to treating spasmodic 
dysphonia (see bibliography for specific reference). 


Symptomatic Voice Therapy 


Symptomatic voice therapy as the first stage in attempts to 
alleviate the disorder should be started with all patients 
except those in whom a neurologic type that will not 
respond to behavioral management has been definitely 
established. However, even when behavioral management is 
not considered to be effective as a primary approach, Murry 
and Woodson (1995) have demonstrated it may enhance 
and/or prolong effects of phonosurgical interventions. The 
voices of patients who have the tremor and dystonic types 
do not respond well to attempts to alter their control over 
their spasms. Although there is no harm in trying such ther- 
apy with patients whose dysphonia is neurologically related, 
especially if the clinician is not sure about the etiology, 
expectations of improvement should be guarded. These 
patients often give the impression of being able to alter their 
voices during therapy, but their ability to sustain less spastic 
voice in their daily lives usually breaks down. 


Musculoskeletal Tension Reduction 


In patients who have nonorganic forms of adductor spas- 
modic dysphonia, muscle tension reduction, as described 
elsewhere in this book, will often yield normal, or nearly 
normal, voice within a brief period of time. If the muscle ten- 
sion is associated with an active conversion reaction, such 
physical manipulation will be only partially successful or 
totally unsuccessful, pending resolution of the underlying 
emotional conflict. However, even in cases in which conflict 
is active, considerable voice improvement can be expected. 
However, such patients will also require psychologic coun- 
seling. 
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Voice Quality and Pitch Modification 


Paradoxically, normal or nearly normal voice under certain 
conditions of phonation becomes apparent to most clini- 
cians as they investigate patients who have adductor spas- 
modic dysphonia, except the essential tremor type. A patient 
may have a severely strained voice during contextual speech 
most of the day, but it may be normal on arising, when 
shouting, laughing, crying, or singing. It may also be free of 
spasm during vowel prolongation even though not during 
conversation. It may be nearly normal at higher pitch levels 
or when phonating with a breathy voice quality. 

These conditions of improved voice often give promise of 
relief, suggesting to the clinician that if the patient can 
phonate more easily at a higher pitch or using a breathy 
voice during vowel prolongation, or during reading of prac- 
tice material in the clinic, the patient consciously ought to 
be able to sustain spasm-free voice during daily conversa- 
tional speech. And, in many patients, such improvement 
has been effected. Cooper’s (1980) therapy program for 
adductor spasmodic dysphonia is an example of the school 
of thought that believes voluntary alteration of phonation, 
and respiration, can benefit these patients. He advocates as 
individual and group therapy (1) reestablishment of natural 
or optimum pitch and “tone focus in the mask area” using 
the spontaneous “um-hum” method, (2) correct breath 
support using gentle, abdominal breathing, and (3) “peri- 
patetic” voice therapy during which the clinician works 
with the patient at the actual site and under actual speak- 
ing conditions. 

One of the few data-driven reports on success of behav- 
ioral management with this population was reported by 
Murry and Woodson (1995). Unlike Cooper, they did not 
work directly on tone focus but rather facilitated reductions 
in voice break and vocal effort by using “easy onset” voicing, 
continuous airflow during phonation, and reduction in artic- 
ulatory effort. This direct voice therapy was combined with 
Botox (Botulinum toxin, Allergan, Inc., San Francisco) injec- 
tions and resulted in the positive effects from Botox lasting 
longer thereby decreasing the number of injections required. 
Dedo and Idzebski (1983a,b) also emphasize the importance 
of voice therapy in combination with surgery for maximum 
rehabilitation results. Boutsen (1992) and Pearson and 
Sapienza (2003) suggest that this may be a form of sensori- 
motor retraining. “Theory holds that muscular weakening 
(such as that occurring after Botox injection) effectively sup- 
presses maladaptive sensorimotor loops, which feed dys- 
tonic symptoms. It is during this period of weakening that 
the patient is most able to acquire (and ideally habituate) 
new sensorimotor templates. In the case of ADSD, this 
retraining holds as its focus the production of ‘easy’ and con- 
tinuous voicing while minimizing extraneous muscular ten- 
sion” Pearson and Sapienza, 2003, p. 334). 

Harrison, Davis, Troughear, et al. (1992) have suggested an 
alternative approach using inspiratory voicing. In theory, this 
treatment requires abduction of the vocal folds coincidently 
with activation of the inspiratory muscles and provides a 
counter force to the abnormal adductory muscle action and 
produces more fluid speech. In practice, it is difficult to 
habituate, rarely sounds normal, and works for a limited 
number of patients. 


Psychotherapy 


As a means of alleviating adductor spasmodic dysphonia, 
historically psychotherapy has been highly suspect. This atti- 
tude may have developed because of a failure to separate the 
neurologic from the psychogenic types. Adductor spasmodic 
dysphonia due to conversion reaction or to depression or 
anxiety often yields to the emotional catharsis and discus- 
sion and resolution of emotional conflict. Often, the spastic 
dysphonia, whatever the cause, produces such withdrawal 
and depression that psychologic help is warranted on that 
basis alone. 


Recurrent Laryngeal Nerve Resection 


The first person to perform surgery to alleviate sphincter 
laryngospasm responsible for spastic-like disturbances of 
voice was Réthi (1952), who blamed the disorder on the sty- 
lopharyngeus muscle and divided it in one such patient. 
Twenty years later, Hammenberg (1972) advocated injecting 
either the posterior wall of the pharynx or the stylopharyn- 
geus muscle to anesthetize these sphincters, describing the 
effect of the injection as highly favorable, the voice recover- 
ing in quality and volume and even persisting after the Novo- 
cain (procaine hydrochloride) had worn off. Neither surgery 
to the stylopharyngeus muscle nor injection of the pharynx 
ever became popular as a therapy for adductor spasmodic 
dysphonia. What did was a surgical approach in which one of 
the recurrent laryngeal nerves in the neck is severed, paralyz- 
ing one vocal fold and thereby reducing the degree of adduc- 
tor force, allowing a freer flow of air through the glottis 
during phonation. Called the recurrent laryngeal nerve sec- 
tion, the procedure was first introduced in 1976 by Dedo. 
After first temporarily paralyzing one vocal fold by injecting 
one recurrent laryngeal nerve with lidocaine and listening for 
improvement in the voice to determine if the patient is a 
good candidate, a recurrent laryngeal nerve section is done. 
Under general anesthesia, a collar incision is made one 
finger-breadth below the cricoid cartilage. Dissection is car- 
ried down to the anterior surface of the trachea and around 
its right side to the tracheoesophageal groove. The recurrent 
nerve is identified ordinarily at the level of the inferior pole 
of the right thyroid gland 1 cm lateral to the trachea. Direct 
laryngoscopy is then performed so that the right vocal fold 
can be observed while an assistant crushes the presumed 
recurrent nerve with a hemostat. If the vocal cord contracts 
abruptly at the instant of crushing, the structure is presumed 
to be the whole recurrent nerve and is ligated . A 1-cm seg- 
ment is then removed adjacent to the inferior pole of the 
thyroid gland ~3 to 4 cm below the cricoid (Dedo, 1976). 
Since Dedo’s initial description, recurrent laryngeal nerve 
surgical variations have included laryngeal nerve crush 
(Biller, Som, and Lawson, 1983), recurrent laryngeal nerve 
avulsion (Weed, Jewett, Rainey, et al., 1996), and selective 
laryngeal adductor denervation-reinnervation (Berke, 
Blackwell, Gerratt, et al., 1999; Smith et al, 2006). The his- 
tory of the results of recurrent laryngeal nerve surgery and 
estimates of its value reported by different investigators 
require careful reading and thought because of differences 
in surgical procedures, philosophies and methods, and 
length of postsurgical follow-up. All the references on the 
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surgery cited at the end of this chapter in the Additional 
Reading section ought to be carefully scrutinized. When the 
findings from all of these studies are weighed, they demon- 
strate: 


+ The immediate postsurgical effects on the voice are over- 
whelmingly favorable, and for the following reasons. The 
voice is freed of the strained, hoarse quality and the often 
fatiguing physical effort to produce it. Facial grimacing 
and other body movements associated with the spasms 
diminish or disappear. Patients experience a reversal of 
their depression and withdrawal. 


+ The voice quality during the early postsurgical period 
tends to be breathy, weak, and hoarse. With time, the 
voice becomes clearer and louder. However, only a minor- 
ity of patients can be said to achieve a completely normal 
voice. Often some degree of breathiness or hoarseness 
remains in patients regarded as successful. If either of 
these symptoms is intolerable for the patient, secondary 
surgical procedures have been developed. 


+ With the passage of time, many patients experience par- 
tial or complete return of their adductor spasmodic 
dysphonia to presurgical levels of severity or worse. The 
percentage of patients who fail depends upon which 
studies are consulted. They range from 15 to 64% failure 
3 years after surgery. 


+ Explanations for return of the spastic dysphonia are neu- 
ral regrowth or “sprouting” and/or overcompensation. 
Overcompensation may involve hyperfunction of the con- 
tralateral true and false, or supraglottic pharyngeal con- 
strictors or sphincteric action of the false vocal folds and 
inferior pharyngeal constrictor muscles. Some patients 
benefit from secondary procedures done to debulk the 
vocal folds. 


+ Females experience a higher failure rate than males for 
unknown reasons. 


+ There is no way of predicting who will succeed in the 
longer term and who will fail. 


Patients have a greater tendency than clinicians to rate the 
postsurgical voices as better (Sapir et al, 1986). 


+ To the patient, the reduction of effort and fatigue of 
phonation is as important as improvement in the sound 
of the voice. 


+ Many patients who have failed within 3 years of the sur- 
gery say they were grateful for whatever relief they had 
from the spastic dysphonia. 


 Postsurgical voice therapy may help patients destined to 
remain improved to maintain a smoother, clearer voice 
but will probably not prevent recurrence of the spastic 
dysphonia in patients whose voices are destined to dete- 
riorate. 


Side effects of the surgery are minimal. Some patients 
experience a temporary need to swallow fluids carefully 
to avoid minor dysphagia, but this is not a serious or long- 
term complaint. 


+ Patients who have experienced a return of the spastic 
dysphonia are sometimes helped by further surgery, con- 
sisting of vocal fold thinning with carbon dioxide laser. 


Patients whose voices have remained excessively breathy 
for a long period after surgery owing to an excessively 
wide glottis may benefit from vocal fold medialization by 
injection of some material into the paralyzed vocal fold or 
thyroplasty. 


An alternate surgical procedure in which the laryngeal 
nerve is crushed instead of severed is not recommended 
owing to the unusually high failure rate, greater than 85%, 
within 10 months, and is probably due to reinnervation of 
the nerve. 


What can clinicians conclude about the wisdom and advis- 
ability of performing recurrent laryngeal nerve removal or 
sectioning on adductor spasmodic dysphonia patients? How 
should they counsel their patients when the subject of sur- 
gery is raised? A conservative answer is to advise them of all 
treatment options and of the results of studies that show a 
degree of risk of return of the voice, but that there should be 
no fear that they would be worse off from the surgery than if 
they had not elected it. The positive aspects of the surgery 
need to be presented as well; that successful patients can 
experience excellent voice and are grateful for the benefits of 
the surgery. The lidocaine block as a preliminary procedure to 
allow the patient, family, and clinician to evaluate the simu- 
lated effects of surgery is a highly useful way of eliminating 
patients who are poorly motivated or who have excessively 
high expectations from the surgery. Allowing prospective 
patients to evaluate the audiotape and videotape recordings 
of patients who have had the surgery is another useful way of 
educating the patient as to the effects of surgery. The most 
balanced view of the surgery would be that, whereas it is not 
the solution for all patients, it can afford value for others. It 
should be carefully offered as a temporary benefit for highly 
motivated patients who have severe adductor spasmodic 
dysphonia and who have seriously suffered psychologically 
and vocationally from their disorder. 


Botulinum Toxin Injection 


Injections of botulinum toxin (Botox) has emerged as the 
preeminent and preferred approach due to its high rate of 
success and low risk of side effects (Blitzer, Brin, Stewart, 
1998). Botox is injected either unilaterally or bilaterally 
directly into the thyroarytenoid or lateral cricoarytenoid or a 
combination of both muscles, which produces partial paral- 
ysis, preventing spastic hyperadduction of the vocal folds. 
Botulinum toxin is a potent neurotoxin produced by the 
bacteria Clostridium botulinum found in one form of food 
poisoning. Of eight known strains, the type A botulinum 
toxin is the most potent and may be crystallized in stable 
form. When injected in low doses, it binds firmly to muscle 
and is rapidly fixed at the terminal nerve fibers with little 
remaining toxin allowed to pass into the general circulation, 
accounting for its relative safety (Cohen and Thompson, 
1987). In the rare instances when after repeated injections 
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patients have built up sufficient antibodies that they no 
longer respond to type A botulinum toxin, an alternative 
form may be used. 

Physiologically, botulinum toxin blocks the release of 
acetylcholine from the nerve ending, interfering with cal- 
cium metabolism and, in effect, denervating muscle fibers 
for months; because the motor end plate does not release 
acetylcholine, the muscle fibers cannot contract. When den- 
ervated, the muscle atrophies but then develops new nerve 
sprouts, reinnervating the muscle, reversing the weakness 
over a period of approximately 3 months. 

The toxin is injected either percutaneously, transorally, or 
transnasally. Percutaneous injections are made through the 
cricothyroid membrane into the thyroarytenoid muscle by 
means of a syringe with a monopolar, Teflon-coated, hollow 
electromyographic recording needle. The hypodermic nee- 
dle functions as an electromyographic electrode to locate the 
thyroarytenoid muscle before the toxin is injected into it. 
The alternative injection techniques were developed in an 
effort to increase the accuracy with which toxin could be 
administered and eliminate the need for electromyography 
(EMG) monitoring. Surgeon preference and training, individ- 
ual patient needs, equipment availability, and convenience 
generally determine the type of injection. 

The toxin has been injected in patients’ vocal folds both 
unilaterally and bilaterally. A meta-analysis of existing liter- 
ature suggests that neither bilateral nor unilateral vocal fold 
injection is consistently associated with better outcomes 
(Boutsen, Cannito, Taylor, et al., 2002). Nevertheless there 
are suggestions in the literature that the two techniques 
have different outcomes. Blitzer (1988) suggests bilateral is 
preferable to unilateral for the reasons that bilateral injec- 
tions require considerably less toxin and that a better voice 
can be achieved. Bielamowicz and associates (2002) provide 
some evidence supporting unilateral preference to minimize 
side effects of persistent breathiness and increase duration 
of symptom relief. In a comparison of unilateral and bilateral 
injections, both Ludlow, Bagley, Yin, et al. (1992) and Mal- 
oney and Morrison (1994) concluded that unilateral tech- 
niques in females result in better functional results. Patients 
who have had a return of their spastic dysphonia after recur- 
rent laryngeal nerve resection have also benefited from the 
procedure. 

The toxin takes effect within 24 to 72 hours, during which 
the voice becomes breathy. However, patients’ voices 
respond differently because of differences in the severity of 
their spastic dysphonia and because the amount of toxin 
needed depends on the individual patient. Laryngoscopic 
examination after the induced weakness of the vocal folds 
discloses that they are not completely paralyzed but appose 
each other with less than full closure, revealing a slight glot- 
tal opening during phonation. The initial breathiness disap- 
pears within days, and a smoother, nearly normal voice can 
last from 3 to 6 months, toward the end of which the strained 
voice gradually returns to its preinjection status signaling 
the need for reinjection. 

Injections are performed on an outpatient basis with a 
minimum of discomfort. The degree of weakness can be con- 
trolled by the administration of low doses of the toxin in the 
beginning and by titrating the dose until the proper amount 


for producing optimum voice with minimal side effects is 
found. 

The voice quality achieved from the botulinum toxin injec- 
tion has been found to be at least as good as, and in some 
cases better than, the voice quality obtained from recurrent 
laryngeal nerve resection (Miller, Woodson, and Jankovic, 
1987). In addition to the improved voice, as in recurrent 
laryngeal resection, patients also report reduced physical 
effort to phonate, corroborated quantitatively by a consider- 
able reduction in pre- and postinjection intrathoracic pres- 
sures. For example, in a study reported by Miller, Woodson, 
and Jankovic (1987), in one patient the preinjection intratho- 
racic pressure was 22 to 23 cm H20 and 5 cm H30 after injec- 
tion, and in a second patient, it was 75 cm H20 preinjection 
and 3.5 cm H20 postinjection. 

To review, the botulinum toxin injection for adductor 
spasmodic dysphonia is currently the therapy of choice for 
intractable adductor spasmodic dysphonia that is refractory 
to other therapies, though it is not necessarily the ultimate 
answer and the search for better treatments continues. The 
voice quality obtained is as good or better than the recurrent 
laryngeal nerve resection, the injection can be given as an 
outpatient procedure rather than under general anesthesia, 
and the toxin does not produce a permanent paralysis of a 
vocal fold. The latter advantage is especially attractive to 
patients and practitioners alike who are reluctant to para- 
lyze a vocal fold permanently if it can be avoided. Repeated 
injections produce similar or identical results. The main dis- 
advantage to the procedure is that the patient needs to be 
reinjected within 3 to 6 months. However, highly motivated 
patients have reported that this is a minor inconvenience 
considering the relief from the laryngospasms and improve- 
ment in voice and ease of phonation. Because the long-term 
effects of repeated injections have not yet been determined, 
it is not known for how many years patients will continue to 
derive benefit from repeated injections or whether the 
immune system will eventually negate the effects of the 
toxin over time (Blitzer, Brin, Fahn, et al., 1988; Brin et al, 
1987). Because the injections for spasmodic dysphonia are 
given in small amounts of toxin, particularly in contrast with 
other Botox treatments such as for torticollis, for which there 
are longer-term data, the risk of acquiring resistance to 
Botox is considered low. 

Laryngeal framework surgery done to reduce laryngeal 
tension has also been reported with variable success. This 
midline lateralization (Isshiki type II) thyroplasty decreases 
vocal fold closure by increasing the lateral dimensions of the 
glottis. In this procedure, the thyroid cartilage is split down 
the middle, and shims are placed between the two to widen 
the space. If the mechanical advantage desired by this sur- 
gery is not initially achieved, the shims can be down- or up- 
sized as needed in a secondary procedure (Isshiki, 1998). 

Case studies using alternative treatments such as hypnosis 
and acupuncture have been reported with varying degrees of 
success (Scheer and Lee, 2001). The fact that they are case 
studies does not mean that they should be negated but 
rather used to point to directions of future inquiry. 

In summary, therapy outcomes for ADSD are mixed. There 
is no single treatment that has consistently positive results 
without side effects or need for subsequent procedures. 
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Behavioral management coupled with various surgical pro- 
cedures appears to enhance results. 


¢ Abductor Spasmodic Dysphonia 


Definition 


As previously stated, spasmodic dysphonia is not a homog- 
enous condition. We now turn our attention from adductor 
spasmodic dysphonia to abductor spasmodic dysphonia. 
Abductor spasmodic dysphonia is a term used in reference to 
a perceptually distinctive voice in which normal or hoarse 
voice is interrupted by moments of breathy or whispered 
(unphonated) segments (Aronson, 1973). The term abduc- 
tor spasmodic dysphonia was chosen because it appeared as 
if the vocal fold physiology responsible for the voice disor- 
der was the opposite of that which occurs in adductor spas- 
modic dysphonia; that is, instead of spastic or spasmodic 
hyperadduction of the vocal folds producing moments of 
strained voice or voice arrests, the vocal folds spasmodi- 
cally hyperabduct, releasing bursts of unphonated air. 
Although the disorder has not been long recognized, evi- 
dence is beginning to show that, as with the adductor 
forms, abductor spasmodic dysphonia may not be due to a 
single cause but may have either psychogenic or neurologic 
substrates. 








Pathophysiology 


Fiberscopic viewing of the vocal folds during connected 
speech discloses that, with each breathy air release, a syn- 
chronous and untimely abduction of the true and false vocal 
folds occurs exposing an abnormally wide glottic chink. This 
wide glottic chink may encompass the length of the glottis or 
may be exhibited as a bowed configuration of the midmem- 
branous portion of the glottis. Frequently, bowed vocal folds 
also may be found on laryngoscopic examination. These 
abductor spasms, or loss of laryngeal tone, are triggered by 
consonant sounds, particularly the unvoiced ones, especially 
when they are in the initial positions of words. For example, 
in some patients it is possible to inhibit the breathy release 
of air entirely by having the patient read material containing 
only voiced consonants. Conversely, it is possible to precipi- 
tate rampant bursts of breathiness by asking the patient to 
read passages saturated with unvoiced consonants. The 
duration of breathy segments evoked by initial stop conso- 
nants has been measured at 2 to 3 times normal, but the total 
duration of words can remain normal nevertheless; the 
period of aspiration intrudes into the vowel segment, deplet- 
ing its energy (Zwitman, 1979). 

Figure 6.5 shows spectrographs of a normal speaker and 
an abductor spasmodic dysphonic speaker producing the 
sentence: “Berries are good on bread.” The abductor breathy 
segments can be detected by the diminished or absent voice 
fundamental frequency energy at the following points: (1) a 
prolonged interval between the /z/ ending the word berries 


i p 


Figure 6.5 Spectrographs of (A) a normal speaker and (B) a 
patient with abductor spasmodic dysphonia producing the sen- 


e tence: “Berries are good on bread.” (From Zwitman, D.H. 


[1979]. Bilateral cord dysfunctions: abductor type spastic dys- 
¢  phonia. J Speech Hear Disord 44, 373-378.) 
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and the /a/ vowel beginning the word are; (2) between initia- 
tion of the sound /g/ and the vowel /u/ in good with some loss 
of sound energy during the vowel; (3) a prolonged interval 
between the final /d/ of good and the initial /a/ of on; and (4) 
during production of the /b/ and /r/ phonemes of the word 
bread. Aspiration time occupied 96 of 260 milliseconds, or 
36.5% of the word, whereas no aspiration occurs during nor- 
mal production of the word. However, sentences spoken with 
abductor breathiness can take longer than normally phonated 
ones as noted in a study of Merson and Ginsberg (1979) in 
which one patient required 2150 milliseconds compared with 
a normal control who spoke the same sentence in 1775 mil- 
liseconds, and whose mean airflow rate was 77 cm?/s com- 
pared with the dysphonic person’s airflow of 400 cm?/s. 


Onset, Course, and Factors Affecting Severity 


Abductor spasmodic dysphonia usually begins insidiously, 
not essentially differently from onset in patients who 
develop adductor. The voice may begin as a nonspecific 
hoarseness or breathiness and over a period of days or weeks 
begin to show signs of intermittent breathy air release. 
Severity of the dysphonia fluctuates under conditions sim- 
ilar to the adductor type. When the patient is relaxed and 
free of tension, pressure, or worry, the voice can be nearly 
normal. It can be better or nearly normal during conditions 
of anger or laughter. It is much worse when the patient is 
emotionally upset, when required to speak with figures of 
authority, or when the communicative content of speech is 
more than usually important. Severity can be temporarily 
reduced by avoiding voiceless consonants, voicing unvoiced 
consonants, and by phonating at higher pitch levels. 


Factors Associated with Onset 


Factors associated with onset are not essentially different in 
these patients as in those who have adductor type. The dis- 
order can occur in the absence of any known associated 
event or it can follow laryngitis with sore throat or domestic 
or work-related emotional stress. 

A history of previous dysphonias and factors associated 
with onset in five patients who today would be designated as 
having abductor spasmodic dysphonia were reported by 
Aronson, Peterson, and Litin (1964), who found that three 
had had previous episodes of pure whispered (aphonic) 
speech, all five had had previous hoarseness, in two the 
onset had been associated with cold or influenza, in all five 
the onset had been associated with fatigue or exhaustion, 
and in two the onset had been associated with feelings of 
tightness in the throat. 


Etiology 


With the continual gathering of data on these patients, a pat- 
tern appears to be emerging in which these patients, like 
their adductor counterparts, may be divided into psy- 
chogenic, neurologic, and idiopathic types, and in which psy- 
chologic stress may precipitate the neurologic ones. In a 
study by Hartman and Aronson (1981), 10 of 13 (77%) patients 
had a scattering of neurologic signs: deterioration in writing, 


synkinetic facial movements, hyperreflexia, ataxic gait, nasal 
regurgitation, and dysphagia. However, more importantly, 
this study also uncovered a neurologic parallel to one form of 
adductor spasmodic dysphonia of essential tremor. In five 
(30%) of the 17 patients, voice tremor on vowel prolongation 
was documented within the range of 4.5 to 6.0 Hz, which is 
well within the limits of the essential tremor syndrome. On 
laryngoscopic examination, in some patients the abductor 
spasms are rhythmic, and a breathy release of air occurs on 
the abductor phase of each tremor cycle. In other patients, 
voice tremor without breathy air releases is audible on vowel 
prolongation (Fig. 6.6A). Four (31%) of the 13 patients in the 
study who had been given neurologic examinations had 
tremor of the head, face, and extremities. There may be, then, 
at least one definable neurologic type of abductor spasmodic 
dysphonia that parallels the adductor form: abductor spas- 
modic dysphonia of essential voice tremor. 

Some patients have a mixed abductor-adductor spas- 
modic dysphonia; moments of breathiness occur alternately 
with moments of strained-harshness or voice arrest (Fig. 
6.6B). Noting the co-occurrence of abductor and adductor 
laryngospasms, Cannito and Johnson (1981) advanced the 
reasonable proposition that, rather than adductor and 
abductor spasmodic dysphonia being separate disorders, 
they exist on a continuum, a view that is not only plausible 
but one that would help to explain their similarities pertain- 
ing to nature of onset, factors influencing severity, and mul- 
tiple etiologies. 

Convincing evidence also is accumulating to show that 
there is a psychogenic form of abductor spasmodic dyspho- 
nia as well as neurologic. In a psychiatric study, seven 
patients, who had what was then called intermittently whis- 
pered-phonated speech before the coining of the term abduc- 
tor spasmodic dysphonia, had various combinations of the 
following: histories of chronic stress, clinical signs of hyster- 
ical personality, histories of other conversion somatic signs, 
marital discord, repressed anger, immaturity and depend- 
ency, neurotic life adjustments, mild to moderate depres- 
sion, and the “conversion V” on the Minnesota Multiphasic 
Personality Inventory test (Aronson, Peterson, and Litin, 
1966). Voice signs of abductor spasmodic dysphonia of con- 
version disorder were found in the following patients: 


Case Study 6.4 


A 46-year-old housewife and beauty salon operator had 
been under chronic stress from several sources. Her 25- 
year-old son had a behavior problem. She harbored 
unspoken hostile feelings toward her husband, who had 
never adequately supported the family and who was 
unemployed. She was perpetually irritated with her 
employees for their inability to measure up to her stan- 
dards but was unable to express her feelings to them. She 
had chronic and overwhelming fatigue by early afternoon 
and would have to retire to her bed. Her marginal insight 
into her problem was evident from her statement that “I 
am afraid my employees think | am angry with them when 
| lose my voice and can’t talk” (Aronson, Peterson, and 
Litin, 1966, p. 117). 
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A religious, perfectionistic woman had a long-standing 
disappointment in her married daughter who drank and 
read “unacceptable” literature. She disapproved of the way 
her daughter was raising her children. The patient asked if 
“being frustrated and never letting anyone know” could 
cause her to “lose” her voice. Later in the interview, she 
came closer to the central issue. “My only daughter annoys 
me constantly, but | never let her know. . . . Where did we 
fail with this daughter?. . . . | can’t reprimand her. | can’t get 
on her back, because it may be something | have done 
that’s wrong. . . . | don’t admire people who show anger. | 
think you should control it. | have been so angry that | have 
felt actually warm inside, but no one has known that | was 
angry. .. . Joy and love are emotions to be expressed, but 
not anger” (Aronson, Peterson, and Litin, 1966, p. 120). 

A 45-year-old married woman has had intermittent 
moments of breathiness for the past 2 years. Laryngologic 
examination shows normal vocal cord structure and func- 
tion. The patient’s voice disorder began during an 
intensely troubled period of her life. Active in a women’s 
service organization and in line for the presidency, she 
was defeated for this office by a false rumor that she had 
misused the organization’s finances. When the patient 
learned of this plot, she was furious and resentful and was 
strongly tempted to express her contempt toward the 
membership, but decided against this course of action 
because she could not bring herself to express the unlady- 
like feelings she felt toward them. 

During this time of anger, frustration, defeat, and vio- 
lent arguments at meetings of the organization, she 
began to detect feelings of tension in her throat followed 









Figure 6.6 Oscillographic tracings. (A) Tracing of 
an abductor spasmodic dysphonic patient of the 
essential tremor type during vowel prolongation. The 
patient has almost continuous tremor. Patient (B) has 
mixed abductor-adductor spasmodic dysphonia of 
the essential tremor type. Note moments of both 
adductor and abductor (intermittent breathy dys- 
phonia) spasms. (From Hartman, D.E., Aronson, A.E. 
[1981]. Clinical investigations of intermittent breathy 
dysphonia. J Speech Hear Disord 46, 428-432.) 


shortly thereafter by a gradual change in her voice. As the 
arguments grew in ferocity, her voice symptoms became 
increasingly disabling. 

One month after the first signs of the voice change, she 
became involved in an extramarital affair. She recalls, “| 
was a sitting duck to get involved.” After 1 year it was dis- 
covered by the man’s wife. “She came over and called me 
every name in the book. | was ashamed, humiliated, furi- 
ous.” Her voice disorder, by now well entrenched, became 
progressively worse after this confrontation. At present, 
she feels tremendously guilty toward her husband, whom 
she describes as “so good and unsuspecting.” She would 
like to tell her husband of the affair but is so “deeply in 
love” with the other man that she cannot give him up. As 
a result, she is caught in a perpetual conflict (Aronson, 
1973, p. 51). 


This discussion of abductor spasmodic dysphonia would 
not be complete without some patient example. Thus, we 
present a verbatim interview with a patient who came to the 
clinic with moderate to severe abductor spasmodic dyspho- 
nia. The etiology remains unclear from the interview and 
examination, and the disorder has to be considered idio- 
pathic until the accumulation of further evidence. The inter- 
view was selected because of the patient's exceptional 
ability to describe the onset and course of his voice disorder, 
its symptoms, efforts toward therapy, and his reactions to 
the disorder. It is also illustrative, rather typically in many 
ways, of the pilgrimage many patients enter into to find a 
cure for their voice disorders. 


Case Study 6.5 


Clinician: Could you tell me now, first of all, your age. 

Patient: I’m 38. 

C: And what is your occupation? 

P: A corporate representative. 

C: Are you married? 

P: Yes. 

C: Do you have any children? 

P: | have three children. 

C: Had you ever had any voice trouble like this before as 
a child or as an adolescent? 

P: No. 

C: Tell me when this first started. 

P: It first started about 6 years ago. | had a very bad cold 
one weekend and when the cold got a little better, | was 
left with laryngitis, and that was the outset of the thing. 
The laryngitis got a little better within about a week, but 
after that it was difficult to say certain words. To begin 
with it was just every so often and just one particular 
sound, which was a hard “c” like “Carol” or “cam,” or a “k” 
sound like in “keep.” It was barely noticeable, but then it 
seemed like each year it got a little bit worse and there 
would be new sounds that would prove more difficult. 

C: What came next? 

P: It seems like it was anything like a “p” or “t” going into 
vowels, and | remember most specifically “Porsche” 
because | was selling foreign cars then. Then it seems like 
maybe “s” and “h” came next. 

C: How long after the laryngitis did the breathy trouble 
start? 

P: Oh, it was the next week. 

C: The laryngitis disappeared. 

P: The laryngitis disappeared but the other didn’t. And 
then it has just gotten progressively worse through the 
years, and it can be a great deal better one day and a great 
deal worse the next. 

C: What kinds of things make it better or worse? 

P: | think the things that make it better are almost com- 
plete relaxation. If | can get into a super relaxed position 
and get my voice into a real low level, then it tends to 
operate fairly well. Many people who might meet me in 
this circumstance might think that | don’t really have any 
voice problem. Unfortunately, the world that we live in 
doesn’t permit us necessarily to operate at this voice 
level. 

C: Are you saying that the louder you talk... 

P: The louder | talk the worse it gets. 

C: Let’s hear you count to five softly and then do it 
loudly. [Patient follows instructions. ] 

C: It does make a difference, doesn’t it? 

P: You were asking what makes it a little better, and | 
think anybody who runs into something like this makes 
compensatory adjustments. | know which words I’m 
going to have difficulty with to a large degree, so | can 
avoid those words. For instance, | know that | can’t say 
the word “hard” so | don’t say it. | say “difficult.” | 
started to say “words that | can say” and it was like a trig- 
ger; you can’t say that, so | didn’t say “say” | said 
pronounce. 


6 Spasmodic Dysphonia 125 





C: Does time of day make it better or worse? 

P: Yes. Usually it is easier early in the morning and it 
gets, depending on what | do during the day, progres- 
sively worse. 

C: If you should go on vacation or be involved in some 
other release from work, will it be better? 

P: To a degree. Very early on the degree might have 
been greater when, say, | went to the lake or something 
and just did nothing but lay on a beach or go fishing. The 
degree that it gets any better is less now than it was very 
early. 

C: Let’s go back to the beginning of this voice disorder. | 
wish you would think about the answer to this question 
very carefully. Before or during the time that you had this 
laryngitis and developed the voice disorder, were you 
under any unusual stress different from what might have 
been under usual life circumstances? 

P: I’m not giving you a quick answer, because | have 
thought about this a lot, because the question has been 
asked a lot. The only thing that | can remember as being 
unusual is, | knew for six months in advance of the voice 
trouble that | was being considered for a promotion. | 
know that | was pushing, because | wanted it, and the 
weekend that the voice disorder came on was when | got 
the promotion. 

C: Were you looking forward to the promotion? 

P: Yes, | wanted the promotion. | was not looking for- 
ward, and | have to say this in all honesty, to the 
prospects of having to work very closely with the owner 
of the company that | would be working directly with. He 
was a very abrasive, intimidating individual, and | knew | 
was going to have to work with him if | got the promo- 
tion. l'II be as honest as | can be; | am sure that gave me 
some apprehension, because he was a very intimidating 
individual. 

C: Well, you’ve left that company since. 

P: Yes, that company went out of business. 

C: How long were you working for this individual before 
that happened? 

P: Three years. 

C: And during that time your voice disorder got gradu- 
ally worse. 

P: Yes. 

C: Did the relationship between you and this man work 
out to be as threatening as you thought it would be? 

P: No. 

C: Was it as traumatic as you had anticipated? 

P: No, it wasn’t. 

C: And so here you are another three years after parting 
company with this individual still with the voice disorder. 

P: Right. 

C: We want to be careful not to assign effects to erro- 
neous causes. 

P: Oh, I understand. 

C: Two things that happened in close proximity don’t 
necessarily mean that they are cause and effect. Did any 
of the psychologists or psychiatrists you had consulted 
attach any importance to the anticipation that you had 
about the promotion as it might relate to the voice 
disorder? 


126 Clinical Voice Disorders 


P: Not any major importance. They didn’t really think 
there was any direct relation there. 

C: And as far as you know there were no other emotion- 
ally upsetting family or business problems. 

P: Not at that point. There have been since then, but not 
at the outset. 

C: And stresses that have happened since. Have they 
worsened your voice? 

P: Well, | don’t know whether they worsened my voice or 
whether it would have gotten worse anyhow. There has 
been a fairly progressive deterioration through the years. 
There probably has not been any deterioration this year. 
Maybe the last two years it has been fairly stable. 

C: How would you honestly assess the effects of the 
voice on your professional, domestic, and social life? 

P: | don’t want to use the word devastating, because 
that seems to be a little too much, but it has had a very 
dramatic effect. 

C: In what way. 

P: For instance, l'Il talk about just little things. When | 
was coming over here today on the bus there was a gen- 
tleman sitting in the seat beside me. | would normally be 
very eager to strike up a conversation with him, but | am 
very reticent [sic] to do that because it is going to be 
embarrassing, so as a result | don’t seek out people. | 
avoid saying anything just in day to day conversation; 
when you go to a restaurant and something is amusing 
and you want to tell somebody about it, | don’t. | don’t 
participate. 

C: Why is that? 

P: Because the words don’t happen. The words don’t 
come out, and it’s embarrassing. 

C: Have you had abnormal reactions from other people? 

P: Not generally. People normally don’t have anything 
to say about it. That is on a very basic level. On a profes- 
sional level, it does affect me, particularly with new cus- 
tomers. | will avoid going to see people whom | should 
go see, customers or potential customers. Even if there 
were no voice problem there would be a certain amount 
of stress involved in going to see these people anyhow 
due to the nature of their business, the size of their busi- 
ness, the nature of the individual, or whatever. Just 
recently | was being considered for a promotion in my 
company, and my boss, quite candidly, said that | would 
not get it as a result of the speech disorder. He told me 
that as a friend as much as anything else, because the 
particular position would involve a great deal of sales 
training, which is getting in front of a group and teach- 
ing them about machinery, which | am good at. I’m as 
good as anybody | have ever seen at sales training, at 
taking a group of people and keeping them interested in 
what they are learning as they sit there an hour or two 
hours. | can do that, and | can do it well, but | can’t do it 
under the circumstances that | am working with now. 
So, instead of looking for opportunities to do that, | look 
for ways to avoid it. In social situations if I’m at a party or 
something, rather than seeking out people to talk to | 
try to get into a corner or get where | can talk with just 
somebody that | know rather than getting into a threat- 
ening situation. 





C: Has the total effect of the voice made you periodi- 
cally or chronically depressed? 

P: Periodically, yes. | would be the first to admit that. 

C: How bad is it? 

P: Here again, sometimes it is worse than others. The 
last two and a half years have added to my voice problem. 
| went into business, which failed. My wife became seri- 
ously ill. So, in answer to your question, do | ever get 
depressed, yes | do but | really make an effort not to. As a 
matter of fact the people in my office, the people that | 
talk with daily over the telephone, they say, “Golly, I’ve 
never known anyone who stays up like you do. How do 
you do that? You just always seem to have something nice 
to say.” I really work at doing that because | don’t want to 
be the other way. 

C: What therapies have been tried from the very begin- 
ning to help the voice? 

P: At the very beginning | went to the regular doctor, 
and he said | had an inflammation and then gave me peni- 
cillin, or something, which did nothing. Then | went to an 
ENT specialist, and he tried a couple of medicines on me, | 
don’t remember what they were, and then he said maybe 
you need to try speech therapy. So | went to the Speech 
and Hearing Center for nine months. 

C: Any benefit? 

P: None. Then several months after that | went back to 
the ENT doctor, and he didn’t really know what to do. 
Then | went to a regular GP, and immediately, he said “I 
think this is something that has an emotional base,” so 
he set up an appointment for me to see a psychiatrist. | 
said, “That’s great. I'll go see a psychiatrist.” So | went to 
see a psychiatrist for about six months, and he said one 
day, “Mr. —, we can keep doing this, but | don’t think I’m 
doing you any good, and | don’t think there is any evi- 
dence of a hysterical reaction, or hysterical conversion. 
There is just nothing that leads me to believe that so | 
really don’t think we ought to continue.” So | didn’t do 
anything there again for a bit. | went to another throat 
specialist, did that for several months. | ended up at 
another GP, and he said, “I think you ought to try hypno- 
sis.” | said, “Okay I’m open. I'll do it.” So he said, “All 
right, let’s do that.” He was trained in hypnosis. | did that 
for about three months. He said, “It just doesn’t seem to 
be doing any good and | don’t understand it. | thought 
that it would.” Another doctor referred me to a second 
speech pathologist. She is the one that first used the 
word “spastic dysphonia.” She said, “There is really noth- 
ing that you can do for spastic dysphonia other than 
speech therapy, and there is a psychologist here who 
specializes in this with hypnotherapy, and | want you to 
go see him.” | said, “All right, I’ve been through it once, 
but maybe | wasn’t at the right place.” | went to the hyp- 
notherapist there and was there once a week for about a 
month, and it just didn’t do anything even though my 
speech was alittle better under hypnosis. | can do that to 
myself just by relaxing, so they weren’t really doing any- 
thing. | went back to the ENT doctor and he said the only 
thing he could do was refer me for speech therapy. I had 
already been through it once but thought maybe | just 
didn’t have the right person, so | went back into speech 


therapy for little over a year, and at the end of about a 
year she helped to a degree. | think she would help any- 
body with speech to a degree, and she did work with me 
on breath control and relaxation techniques and some of 
the compensatory techniques. For instance, like, if you 
can avoid attacking a word, just to roll through a word. If 
| were to say “trouble” then it is difficult if | just attack 
that word “trouble,” but if I’m talking to you about 
“somebody-in-trouble” then it rolls right on in there. So 
she helped me to that degree. 

C: So the learning how to attack the words helped 
somewhat, but it didn’t really make... 

P: No. Then another doctor said, “There is a psycholo- 
gist who has treated spastic dysphonia patients. | spoke 
with her and she thinks she can help you with therapy.” 
I’m off again. That was last year. | had begun to get a little 
bit of information that this thing may have something to 
do with neurology or neurologic condition. 

C: Before you get onto that, you had some more psy- 
chotherapy with the psychologist. Did it benefit? 

P: No. It didn’t help at all. 

C: How long did that go on? 

P: That went on for almost a year. 

C: Then you started... 

P: Then! went to a neurologist just about when I started 
going to that psychologist. He was very honest. He gave 
me a neurologic exam. He said, “I just did not see any evi- 
dence of a neurologic condition, but | will if you are inter- 
ested in doing this be happy to prescribe some medicines 
that are used to treat certain movement disorders, which 
might be of some benefit. If they do any good that is 
great. If they don’t do any good, then we will stop and try 
something else.” 

C: There were several different kinds? 

P: There were about six. 

C: And after that... 

P: And after that | haven’t really been doing anything, 
because | really didn’t know what to do. Then, about six 
months ago | went back to therapy a little bit, we used a 
voice masker, the Edinburgh Voice Masker, and that had a 
little positive effect when | first put it on and then after | 
used it for just a few minutes the effect diminished but 
| felt like maybe if | wear that enough that will help, so | 
ordered the Edinburgh Voice Masker. When | did that | got 
on somebody’s mailing list for stuttering. | got a newslet- 
ter from — University, the stuttering center. So, | wrote, 
and they wrote back and said there is really nothing we 
can suggest other than speech therapy and | am sure 
there is a very good speech pathologist in your area if 
you'll get in touch with somebody. Then, you wrote me 
and here I am. 

C: Did the university people talk to you on the tele- 
phone? 

P: No. 

C: Iwonder if they didn’t think you had the other kind of 
spastic dysphonia. 

P: | would imagine that they did. 

C: You understand there are two kinds and the kind that 
is most familiar to people is the kind in which the vocal 
cords come together excessively. 
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P: The adductor. 

C: They probably thought what you meant by “spastic 
dysphonia” in your letter was the adductor type, but you 
really meant abductor. 

P: Well, | didn’t know that until | talked to you. 


Summary of Abductor Spasmodic Dysphonia 


+ Abductor spasmodic dysphonia is a term that describes 
breathy releases of air owing to sudden hyperabduction 
or loss of tone of the true vocal folds. 


Evidence indicates that these spasms can be due either to 
psychogenic (conversion reaction) or neurologic (essen- 
tial tremor) causes or can be idiopathic. 


+ There is a propensity for the abductor spasms to occur 
most strongly during the production of unvoiced conso- 
nants, to a lesser extent during voiced consonants, and 
least of all during vowels. 


Like the adductor spasmodic dysphonias, onset is gradual 
and often associated with psychologic stress 


Treatment of Abductor Spamodic Dysphonia 


Resistance to therapy is as true for abductor spasmodic dys- 
phonia as it is for the adductor type. The approach to therapy 
should take a form similar to that for adductor spasmodic 
dysphonia. 

First, it is necessary to determine if the disorder is prima- 
rily psychogenic by means of a thorough psychosocial his- 
tory. As in other psychogenic voice disorders, disclosure of 
personal problems may be accompanied by improvement or 
disappearance of the dysphonia. 

Symptomatic voice therapy should always be tried. For 
abductor spasmodic dysphonia specifically, the abductor 
laryngospasms and breathy air release will diminish or dis- 
appear if the clinician teaches the patient to voice all 
unvoiced consonants. For example, saying the phrase “coffee 
cream” will produce severe breathy air releases in most 
abductor spasmodic dysphonic patients. However, if the cli- 
nician then tells the patient to substitute the /g/ for the /k/ 
phoneme in “coffee cream,” the breathy air releases will be 
diminished. In the same way, if the patient is taught to voice 
all of the remaining voiceless consonants, it is possible that 
the abductor spasmodic dysphonia will either diminish con- 
siderably or disappear completely. Telling the patient to keep 
the vocal cords vibrating through all of connected speech 
will accomplish the same objective. 

Similarly, like adductor spasmodic dysphonia, some 
abductor spasmodic dysphonia patients have benefited from 
phonosurgery. Botulinum toxin injections in the cricothyroid 
and/or posterior cricoid arytenoid (PCA) have temporarily 
reduced symptoms in some patients. However, in compari- 
son with patients with adductor spasmodic dysphonia due 
to dystonia, its effectiveness is less pronounced and occurs in 
a smaller percentage of patients (Duffy and Yorkston, 2003). 
Medialization by vocal fold injection or type I bilateral thyro- 
plasties to bring the membranous vocal folds closer together 
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have benefited others. However, there is no consistently suc- 
cessful surgical treatment for this population. 

We close this chapter with a description of the treatment 
journey experienced by a patient with adductor spasmodic 
dysphonia. The treatise is a typical description of experi- 
ences reported by other patients and helps the beginning 
student comprehend the significance of the problem and of 
the difficulties and frustrations faced by the patient. 


+ Confessions of an Adductor Spasmodic 
Dysphonia Patient 


Pre-Botulinum Toxin Injection 


Four and one-half of the past 5 years of my life have been 
spent in bewilderment. “Why do I sound like this?” The last 
6 months have been spent with a certain knowledge of what 
spasmodic dysphonia is and of a possible route that can be 
pursued to periodic relief. 

Joshua Liebman once wrote a book titled Peace of Mind, in 
which he describes the handling of the death of a dear one as 
a prescribed series of stages through which one must pass. 

I felt, as one with spasmodic dysphonia, that I too went 
through similar stages which I refer to as my plateaus. 


Plateaus 


These plateaus ascended and descended in no apparent order. 
At one extreme, the strongest, I felt rage. My Plateau of Rage. 
An inner rage, which at times went out of control, during 
which, in hopeless frustration, and a sinking capability of tol- 
eration, I, as a person who had always prided herself on some 
semblance, at least, of gentility and femininity, wanted to 
smash and destroy all that was in my way to expose my help- 
less anger at not being able to speak and to be understood. 

As the antithesis of this plateau, I passed through my 
Plateau of Peaceful Acceptance during which I felt rather 
intelligent at being able to lead a life of newly acquired pas- 
sive submission. 

I called this plateau, Moving into My Silent World. I substi- 
tuted a normal existence of verbal contact, physical endeav- 
ors, shopping, sociability, meetings, and so forth, with an 
ersatz life consisting of multihours of piano practice daily, 
followed by hours spent writing three different books that I 
assumed would, at least, be bound for family and/or friends. 
The fact that I learned to play piano rather well again or that 
I wrote 200 poems offered no consolation. I had merely filled 
my days with lonely creativity. 

Yet, even here, lack of speech halted my efforts because I 
was unable to communicate with a printer regarding the fin- 
ished form of these writings. At this point I stopped, and the 
writings remained in my desk drawer. Frustration had even 
entered my silent world. 

The most interesting plateau was reached on August 7, 
1989, when the Mayo Clinic informed me that I had a neuro- 
logic, not a psychologic, disorder. Here began my Bravado 
Plateau. 


For a brief period of time | put to rest the agonizing. | 
shifted the gears of my attitudes. I had been told, at last, 
exactly what spasmodic dysphonia was. 

No longer would I need weekly allergy shots for an allergy 
that I never had. No longer the meetings with speech pathol- 
ogists in which I tried hopelessly to emit clear sustained 
sounds of which I was not capable. No longer to be told that 
I used this horrible strangled speech so that people would 
feel sorry for me. I look back, now, to such a remark as insult- 
ing; to imply I could speak clearly if I wished. How sad, when 
it was all beyond my physical control. 

While in my Bravado Plateau my voice problem became 
everyone else’s problem. Not mine. Others would have to 
learn to like my voice as it was what it was. This was, of 
course, my briefest, shortest lived plateau because too soon 
the inability to communicate on the simplest terms returned 
and the ball of speaking-hardship was back in my court. 

Prior to the botulinum toxin injection, I struggled through 
the Plateau of Research and Attempted Comprehension. 
Once I knew that spasmodic dysphonia was a neurologic dis- 
order, I began to read every medical abstract I could find to 
ascertain not only the why of spasmodic dysphonia, but why 
me. 

I read each paper assiduously, laboriously working my way 
through every intricate medical word and sentence with the 
Merck Manual and the American Medical Dictionary close at 
hand. I compiled my own handwritten dictionary filled with 
words like central cranial dysfunction, monopolar electrode, 
cholinergic, and so forth, none of which I completely under- 
stood. 

There is no information as harmful as too little informa- 
tion. This was, gratefully, my last plateau as I realized I must 
have faith with those who wrote the abstracts and who 
would soon help me. 

Yet, as with all other plateaus that had been self-sought 
during my life with spasmodic dysphonia, this one served 
me in fulfilling more silent weeks alone. 


Telephone Communication 


It has been established in the various papers I have read that 
speaking on the telephone is, to say the least, difficult when 
one has spasmodic dysphonia. I found this to be true and I 
felt it was thus not only because I could not articulate with- 
out spasms but because I could not rely on any facial expres- 
sion to help convey the meanings behind my spastic words. 

Communication is a process of interpreting symbols. 
Words are symbols. In any life we misinterpret more than we 
should, perhaps because our deepest intentions are at odds 
with the messages we project. I could project nothing. 

I felt the strongest reactions of pure, sheer terror at the 
sound of the telephone bell. An actual physical distress. As if 
choreographed: When the telephone rang, though someone 
else would answer it for me, my head would immediately, 
automatically, turn from side to side in panic signifying 
“No!” and I could not relax until I felt, with absolute cer- 
tainty, that the telephone would not be handed to me. 

I knew at an intellectual level this fear was not one that I 
admired in myself, yet the ability to act otherwise was quite 
beyond my control—as long as I had the voice of spasmodic 
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dysphonia. In the last 6 months of my 5-year ordeal I devised 
a system for incoming calls from my husband and my daugh- 
ter, both of who called me daily, they would ring 3 times, 
hang up, and re-ring before I would answer the telephone. 
What torment I felt, how filled with sorrow I was, as I stood 
there waiting to count the rings to see if this would be a call 
I was capable of answering. 

All other incoming calls were recorded on the telephone’s 
answering machine. I am at a loss to convey, and will proba- 
bly find it the most difficult to ever forget, the extreme self- 
pity I experience as I stood frozen near the telephone and 
listened to a friend leave a message, realizing I was so lacking 
in confidence that I could only stand there in tears, unable to 
reach forward to pick up the telephone. 

At such times I stood outside of myself and saw a pathetic 
woman whose continual eruptions of tears had become a 
daily part of living in those months before my botulinum 
toxin injection. 


Social Life 


Before I brought all social participation to a halt, I had 
noticed some rather interesting displays of change among 
our female friends. For example, at dinners, when I would be 
forced into silence because of my broken speech, after futile 
attempts at dinner chit-chat, the other women became 
excessively talkative. Dull personalities who previously 
added monosyllables to conversation became strangely 
loquacious, while I sat silently at the table. 

Were other women more nervous than I? Was that the 
reason? Did my feelings of inadequacy make their talk seem 
more emphatic to me than perhaps it really was? Neverthe- 
less, interestingly, it was a consistent pattern among other 
women in my presence. It is mentioned here because it con- 
tributed to my need to abort all social contacts. 

There are many people who feel unsure when offering 
their arm to a blind person who might be waiting at a corner 
to cross a street. Perhaps, in a fashion, people also feel unsure 
in the company of someone with spasmodic dysphonia and 
its obvious audible speech disorder. 

However, my female friends never abandoned me, nor did 
male friends. They always urged me to join them. They called 
about my welfare. They did not care, they said, how badly I 
sounded when I tried to speak. But I did. 

Well-meaning friends would say, “At least you have no 
pain.” True. No physical pain, but, ah, the mental pain and 
grief. The destruction of an entire thinking process added to 
a weary exhaustive frustration. 


Religion 


I have walked out of a sanctuary during religious services 
because I had experienced the beginning of a true physical 
discomfort coursing through my body as | sat there. Such a 
strong negative reaction in a house of God, where one seeks 
peace, mystified me. 

I could not vocally participate in prayer services while 
around me hundreds of people did so with no difficulty. 
Women many, many years older than I responded with 


forceful clear voices. When | tried to respond, people would 
turn to look at me. The humiliation destroyed me. 

I discovered then the world is not necessarily inhabited by 
kind, feeling, sensitive individuals. 


Stress 


I never thought of stress as the primary source of my speech 
difficulty or of its agony, even before being informed of its 
neurologic origin. I thought of stress, then, as minor prob- 
lems with which we all cope daily. I, on the other hand, had 
something that was much more serious (although I did not 
know what it was) than mere stress. 

I became, with the passage of time, incapable of making 
decisions. I experienced a complete loss of assertiveness, of 
interest. My words were spoken in monotone. I perceived my 
voice as a crying, nonexpressive monotone. My speech had 
no nuances. My sound held no laughter. 

Often I felt as if the flow of thought and the flow of speech 
were not meant to become as one. In normal instances, we 
conceive a thought and with unconscious speed we emit the 
words that project the thought from mind to the ears of the 
listener. With spasmodic dysphonia, a different “process” 
aborted this continuation of my thought/speech pattern. 
This “process” blinked out a quick message to me some- 
where between thought and speech. This “process” said, 
“Why bother? It will not come out right. It will not be under- 
stood.” I could have filled volumes with my thoughts never 
spoken. 

I developed, quite without thinking about it, an involun- 
tary movement with my hand. As I would begin any sen- 
tence, to a stranger or to a store clerk, for example, my hand 
would automatically fly to my throat with the very first word 
I uttered, and I would explain, “I have laryngitis.” Or “I have 
an allergy.” And, worse yet, “I’m sorry.” Always an apology 
for the sound of my voice when | knew, full well, the other 
person did not know me, could not care less, had never seen 
me before, would possibly never see me again. But that hand 
flew to my throat when I tried to speak, from the moment I 
left my apartment until I returned. 


Physical Attitude 


I had always walked very straight and erect. Proud, actually, 
of my good posture. When I sat, as a rule, I did so most com- 
fortably with my back seldom touching the back of a chair. 
I attributed this, and a self-competitive physical attitude, to 
16 years of ballet training and performance. 

However, with my dysphonia, although in these instances 
speech was not involved, I realized I had begun to slump, 
round shouldered, curved spine. Worse, I did not care. It didn’t 
matter. I would shrug my shoulders to imply lack of interest. 

Before the dysphonia took over my life with its cruel force, 
I would swim 60 laps every day. In addition I would jog 4 
miles every day. I gradually became too tired to walk for only 
10 minutes. To walk and to try to speak at the same time 
became impossible without experiencing fatigue. So then 
even an activity that I could have done in relative solitude 
was restricted to me, becoming, as the dysphonia became 
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worse, a nonaccomplished effort. I finally submitted and 
gave up, thus contributing to my further helplessness and 
demoralization. 

I had always been in competition with myself where phys- 
ical endurance was concerned. If I did six of anything today, I 
must do eight tomorrow. This was pleasurable to me, an 
achievement, a satisfaction. Soon gone. 

Not only the sense of competitive accomplishment was 
withdrawn but simple maintenance, as well, could not be 
achieved. Pleasures became impossible, replaced by hope- 
lessness. 

I lost the desire to strive. 


Tolerance Levels 


My tolerance dropped, with the passage of time, in direct 
proportion to the increasing frequency of spasms in my 
speech. All external forces became extremely objectionable. 
It was too noisy, too dark, too light, too long, too short, too 
fast, too slow. 

I had to exert the effort to work at exercising patience for 
nearly everything, seldom succeeding. My tolerance was 
always in simultaneous accord with the way I perceived my 
spastic voice. Life was filled with irritants. I could not help 
myself. My dysphonia had taken me out of the bounds of 
ordinary daily acceptance. 


Familial Significance 


Some additional insight into the psyche of this patient with 
spasmodic dysphonia may be found in the following 
excerpts from a letter that I wrote to my husband. This letter 
had been given to him as I walked into the room for my bot- 
ulinum toxin injection. In part, it reads: 

“These have been the unhappiest 5 years of my life. This 
past month has been the worst of all. I have fought depres- 
sion daily. I have done battle with it and I was able to hold 
myself, at least, to being merely sad, somber, and with- 
drawn.” 

“Sometimes the rage within me exploded out of my sys- 
tem when the frustration of living the life of a recluse 
became more than I could handle.” 

“I don’t know who I am (or was) anymore. I tried to move 
into my ‘silent world’ like a lady. But it’s been hard. I’m not 
sure if I made it, but know I tried.” 

“Trying to speak to you has been such an effort, so tiring. 
But my heart has always been filled with appreciation. I want 
you to know how much I have always appreciated the over- 
whelming patience that you have had for me through these 
past few years.” 

Through these past 5 years, from time to time, through the 
cheerfulness of one daughter who called me daily, I did find 
the only touches of humor that sustained me. She brought 
this gift of humor, which came to me like flashes of clear, 
bright light into my otherwise darkened life. 

Without these brief daily moments of shared laughter, 
which began and ended with her contact, the years would 
have been still more intolerable. 

I could not have struggled in a world where people did not 
understand me when I spoke unless a corner of my brain 


could send a signal to me that implied there can be some 
laughter along with the tears. 

I am indebted to my daughter for lifting my spirits as I am 
to my husband for his sustained love and patience. Without 
each I was never quite sure that I would have endured as 
well. 

As is obvious here, I gave myself little credit for any inborn 
stamina or fortitude. With spasmodic dysphonia I was 
drowning, sinking, and crying for help. Under such circum- 
stances, self-motivation is virtually nonexistent, or indeed, 
very limited. 

In closing, | must establish that all that which has been 
written on these pages has been written within the time 
frame of 7 days following my first botulinum toxin injection, 
in which the anguished memories of the past 5 years are still 
so near the surface. 

In addition, my interpretation of how I personally per- 
ceive, or analyze, the past 5-year period of dysphonia must 
be accepted as those of a woman who had always felt, prior 
to that 5-year period, all of life’s happenings as strong gusts 
of maximum extremes. Whenever possible, of optimum 
extremes. 

My inability to maintain my self-dignity in which I had 
always felt great pride, in retrospect, was the saddest part of 
my life with spasmodic dysphonia. 


Post-Botulinum Toxin Injection 


What wonderment! 

I am worthwhile again! 

The spasms and vocal breaks are no longer heard. It is 
quite incredible that my entire thinking processes and atti- 
tudes have changed so quickly. This is almost, but not nearly, 
to me the miracle of receiving the botulinum toxin injection. 

Not only had my speaking voice cleared, but in the same 
24-hour period my brain and my thinking have also cleared. 

I am a rocket! I have accelerated to a height that I had for- 
gotten existed. How can this phenomenon occur so swiftly? 
What new and good plateaus await me? 

I had forgotten the sound of my own laughter. I had forgot- 
ten the sheer delight of speaking one-syllable words without 
breaks. I have reverted to the woman I was more than 5 years 
ago. 

I remember me! So soon! So soon! How can this be? I 
recall now that I was a happy woman, upbeat, joyous. 

I expected voice improvement. I anticipated a certain 
pleasure to replace the sorrow with which I have lived for 
such a long time. But this? This is unbelievable! 

How else to explain the overwhelming exhilaration of 
being able to do something as basic as ordering a cup of tea 
in a restaurant. For myself. By myself. For the first time in 
years to speak in a clear voice. No longer a waitress to tell me 
she cannot understand me. No longer strangers at nearby 
tables turning to stare at me. 

I have read that with spasmodic dysphonia, not only does 
a voice change but an entire personality can change. With 
this fascinating quick rebound that I am experiencing, I won- 
der about that. 

Perhaps the personality is laid dormant, at rest, eager to 
spring to life anew given the opportunity. 
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I often cried, “I do not know who I am or even who I was 
before the dysphonia.” I could not recall my persona, so sub- 
merged was I in noncommunicative despair. 

Yet, within 24 hours I have become the person I had always 
been, the woman I had completely forgotten. 

For this voice, this life, this clarity of thought, I am 
enveloped in elation. 


Conclusion 


If I am to benefit from this introspective interpretation of 
myself as a woman with spasmodic dysphonia, both before 
and after the botulinum toxin injection, would I, could I, 
handle it differently the next time around? 
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Chapter 7 


Clinical Voice Evaluation 


Nathan V. Welham 


Voice disorders are inherently complex, and their evaluation 
should reflect this. The voice can function as a barometer of 
physical and emotional health; it is closely tied to personal 
and professional identity; and it is a powerful expressive 
communication tool. A comprehensive evaluation of vocal 
function requires a carefully elicited case history, considera- 
tion of psychosocial impact, and a complete assessment of 
laryngeal and vocal fold vibratory function. The clinician 
should hold a strong knowledge base in laryngeal anatomy 
and physiology and be able to integrate assessment data 
within this knowledge framework. The tools available to the 
contemporary clinician range from his or her auditory and 
visual perceptual systems, to acoustic, aerodynamic, and 
endoscopic technologies. In addition, information from the 
entire voice care team, including a range of medical data 
from other disciplines (e.g., asthma-allergy, endocrinology, 
gastroenterology, neurology, oncology, psychiatry, pul- 
monology, radiology), is often critical in piecing together a 
complete clinical picture. 


+ Case History Taking 


No voice evaluation is complete without a comprehensive 
case history. The case history assists in determining the path 
of assessment, points toward potential differential diag- 
noses, suggests prognosis, and helps to direct treatment. 
Given this, a successful case history requires a high level of 
clinical skill. The clinician must be able to efficiently and 
expertly judge the significance of certain information, 
remain unbiased by other specialist opinions, know when to 
pause and listen, and when to probe deeper for clarification 
or detail. The path of questioning and exploration followed 
by any given clinician is as varied as the presentation of any 
given patient. Whereas the clinician brings a depth of clinical 
knowledge and experience to the situation, the patient 
brings his or her personal expertise concerning the history 
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and current status of the presenting voice problem, its impli- 
cations for daily functioning, and its influences on the 
patient as a person. 

It is important to note that as the voice is influenced by 
physical, emotional, and personality states, all of these 
dimensions must be probed in the case history. During the 
interview and voice evaluation, all plausible etiologies 
should be entertained. Viewing the presenting problem as 
an exclusively laryngeal phenomenon can potentially result 
in misdiagnosis; therefore, the clinician should always con- 
sider the possibility of the voice representing a more general 
disorder of the speech mechanism or entire organism. 

The following section details key content areas within a 
typical voice case history. This section does not address 
every conceivable history factor relevant to voice but rather 
covers common issues encountered in the clinic. Of course, it 
is impossible to master the art of clinical interviewing simply 
by reading about it, as every clinician’s skills are honed with 
ongoing training and experience. The subsections and order 
presented here are not inviolable; however, beginning clini- 
cians may benefit from the use of a checklist. 


Key Components 
Familiarization and Rapport Building 


The clinician should strive to facilitate a positive interper- 
sonal environment and relationship with the patient. From 
early impressions, the clinician perceives the patient’s 
reaction to a new situation and may find signs of tension, 
anxiety, or depression through facial expression, posture, 
gestures, and handshake. At the same time, the patient 
makes important judgments about the clinician: how 
interested the clinician is in seeing the patient; how secure, 
at ease, and skillful the clinician is in the interpersonal sit- 
uation; in short, the kind of person with whom the patient 
is dealing. If confidence and a sense of trust are established 
early, the patient will find it easier to discuss his or her per- 
sonal history. 
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Nature, Onset, and Course of the Voice Disorder 


Nature An often helpful starting point in the clinical inter- 
view process is to obtain the patient’s description of the pre- 
senting problem. The patient’s response to an inquiry of this 
nature may demonstrate insight, motivation, confusion, or 
misunderstanding. Some patients will simply recast the 
comments of the referring specialist, whereas others may 
express dissatisfaction with their current diagnosis. Regard- 
less of the eventual accuracy of the patient’s perception, it 
does have clinical significance. If later intervention is to be 
effective, misconceptions must be directly addressed and the 
patient’s statement of a primary symptom considered a 
treatment priority. 


Onset Determining the onset of any voice disorder is 
important in guiding differential diagnosis. Many patients 
find it difficult to establish the precise onset of voice symp- 
toms because most disorders begin gradually rather than 
suddenly. Patients who give a vague answer should be 
pressed to recall the month, day, or even hour when their 
symptoms first began. Voice disorders associated with mus- 
culoskeletal tension, slowly developing vocal fold mass 
lesions, and progressive neurologic disease tend to develop 
gradually. Those associated with acute vocal fold lesions 
(e.g., a hemorrhagic polyp), acute neurologic incidents, and 
psychogenesis tend to occur abruptly. 

It is important to identify significant factors or events 
associated with the onset of voice symptoms. The patient 
may associate the onset of symptoms with a significant life 
event, a period of emotional stress, or an upper respiratory 
infection. Such reports may be indicative of an underlying 
psychogenic component (an upper respiratory infection 
with associated laryngitis can provide a psychogenic trigger 
for dysphonia); however, it is important to explore these 
reports carefully, and interpret them within the context of 
the entire voice evaluation. Many patients with psychogenic 
voice disorders are notoriously unaware of causative emo- 
tional stress or interpersonal conflict. Raising these issues 
and extracting key information within the context of a clini- 
cal interview and later helping patients to understand the 
link between these issues and their voice disorder requires 
counseling skills. Because of its indispensability in the evalu- 
ation and treatment of voice disorders, a separate chapter on 
psychologic interviewing and counseling is included later in 
this book. 

Voice symptoms that first appeared alongside speech and 
swallowing difficulties may be due to a neurologic impair- 
ment. The clinician should be alerted to reports of imprecise 
and effortful speech production, hypernasality, difficulty 
with chewing and swallowing, and nasal regurgitation. 
Weight loss should also be noted. 


Course It is useful to gauge the degree of variability in 
symptoms since the onset of the voice disorder, as well as 
across a typical day. Patients may describe their symptoms 
as consistent, intermittent and marked by periods of nor- 
mal voice, gradually worsening or improving over time, or 
completely unpredictable from moment to moment. 
Patients with nonorganic voice disorders due to muscu- 


loskeletal tension or psychogenesis will often report peri- 
ods lasting from minutes to days when their voices are 
completely normal, only to worsen again. In contrast, 
patients with organic voice disorders due to either vocal 
fold mass lesions or neurologic disease will usually report 
continuously abnormal voice from time of onset, either 
consistent in severity or deteriorating gradually or precipi- 
tously with time. Myasthenia gravis is characterized by 
gradual voice deterioration and fatigue during speech anda 
reversal of these symptoms after rest. Laryngopharyngeal 
reflux may be indicated by intensified voice symptoms dur- 
ing the first few hours of the day, a result of sleeping in the 
prone position overnight. 

A cautionary note should be made concerning variability 
in symptoms. All voice disorders, regardless of etiology, gen- 
erally worsen during physical fatigue and emotional stress. 
The clinician should not automatically link these variations 
with a psychogenic etiology. Further, patient descriptions of 
periods of normal voice production should be questioned 
carefully. Many patients describe as normal what later 
proves to only be an episode of improvement. 


Associated Factors 


It is important to identify factors associated with the pres- 
ence of voice symptoms, as well as factors that tend to exac- 
erbate or ameliorate symptoms. Reports of fatigue, globus 
sensation, tightness and pain in the laryngeal region, 
increased coughing, sensations of irritation and dryness, 
and heartburn should all be noted. The patient may notice 
an improvement or worsening of symptoms as a result of 
various factors. Examples are changes in vocal demand, sea- 
sonal allergies, climate, working environment, and eating 
habits. 


Related Health and Medical Issues 


Several general health and medical issues emphatically per- 
tain to the voice. The clinician should enquire whether the 
patient has any diagnosed respiratory disorders (e.g., chronic 
obstructive pulmonary disease, asthma), gastrointestinal 
disorders (e.g., gastroesophageal reflux disease), neurologic 
disorders (e.g., stroke, traumatic brain injury, pseudobulbar 
palsy, essential tremor, dystonia, Parkinson’s disease, multi- 
ple sclerosis), allergic rhinitis or chronic sinusitis, autoim- 
mune disorders (e.g., rheumatoid arthritis, systemic lupus), 
endocrine disorders (e.g., hyperthyroidism, hypothy- 
roidism), psychiatric disorders (e.g., clinical depression, 
acute or chronic anxiety, schizophrenia), or hearing loss. A 
description of the current medical and pharmacologic man- 
agement of these conditions should be obtained. Previous 
vocal fold, laryngeal framework, head and neck, thoracic, or 
cardiac surgery should be detailed. It is important to deter- 
mine whether the patient was intubated during any such 
procedure and whether radiation therapy was employed in 
the head and neck region. Other laryngeal trauma, such as 
from a physical wound or chemical inhalation, should also be 
noted. 

Excessive alcohol, caffeine, tobacco, and illicit drug use 
is an important area of inquiry and should not be avoided 
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by the clinician. Many substances can have a negative 
influence on the vocal fold epithelium and lamina propria, 
and most patients are unaware that the abuse of these 
substances can exacerbate their voice disorders. When 
questioning the patient, it is helpful to establish both the 
quantity and frequency of substance consumption. The 
discussion of alcohol and illicit drug use may be a particu- 
larly sensitive issue with some patients: those who appear 
reluctant to divulge information should be assured of 
patient-clinician confidentiality. 


Voice Use 


Excessive and inappropriate voice use can be a significant 
factor both in the onset and exacerbation of a voice disorder 
due to the presumed role of excessive vocal fold impact 
stress in vocal fold tissue injury (Gray, Hammond, and Han- 
son, 1995; Gunter, 2004; Titze, 1994b). The clinician should 
explore the nature and degree of voice use across all 
domains of the patient’s life. Vocational demands can be crit- 
ical if the patient gives a large number of presentations, 
spends a significant amount of time on the telephone, in 
meetings or other group Situations, is a performer, or works 
in a noisy environment. Recreational activities such as ama- 
teur or professional singing or acting, attending sports 
events and concerts, cheerleading, or coaching can all tax the 
vocal mechanism. When questioning patients about voice 
use, it is helpful to determine the environment in which the 
voice is being used (indoor, outdoor, air conditioned, humid- 
ified, smoky, noisy, hot, cold), whether aids such as amplifi- 
cation are available and used, and whether the patient has 
received voice training. 


Voice Handicap Index (VHI), Henry Ford Hospital 


Assessment of Psychosocial Impact 


The psychosocial impact of dysphonia on a patient’s daily 
activities and quality of life is a very important area of con- 
sideration within overall assessment. It is intuitive that the 
impact of dysphonia on any individual is the result of per- 
sonal and professional voice demands, the role of voice in an 
individual’s concept of self, and an individual’s ability to 
manage, cope with, and compensate for disease. In an 
attempt to accurately measure the psychosocial impact of 
dysphonia, as well as the functional effectiveness of medical, 
surgical, and behavioral interventions, several instruments 
have been developed, most of which are founded on the 
International Classification of Functioning, Disability and 
Health (World Health Organization, 2001) or its earlier ver- 
sion, the International Classification of Impairment, Disabil- 
ity and Handicap (World Health Organization, 1980). 

The Voice Handicap Index (VHI; Jacobson, Johnson, Gry- 
walski, et al., 1997) is a 30-item instrument designed to 
examine emotional, physical, and functional responses to 
dysphonia (Fig. 7.1). The emotional domain measures emo- 
tional response, the physical domain measures perceived 
vocal function and laryngeal discomfort, and the functional 
domain measures the impact of dysphonia on daily activi- 
ties. The VHI has been validated and data have been pub- 
lished from several patient populations (Jacobson, Johnson, 
Grywalski, et al., 1997; Rosen and Murry, 2000; Rosen, 
Murry, Zinn, et al., 2000). An alternative tool, the Voice- 
Related Quality of Life Measure (V-RQOL; Hogikyan and 
Sethuraman, 1999), focuses specifically on quality of life 
changes associated with dysphonia and contains 10 items in 
two domains (social-emotional and physical functioning) 
(Fig. 7.2). The V-RQOL instrument has been used to measure 








Instructions: These are statements that many people have used to 
describe their voices and the effects of their voices on their lives. 
Circle the response that indicates how frequently you have the 
same experience. 


F1. My voice makes it difficult for people to hear me. 

P2. Irun out of air when | talk. 

F3. People have difficulty understanding me in a noisy room. 

P4, The sound of my voice varies throughout the day. 

F5. My family has difficulty hearing me when | call them 
throughout the house. 

F6. | use the phone less often than ! would like. 

E7. I'm tense when talking with others because of my voice. 

F8. | tend to avoid groups of people because of my voice. 

E9. People seem irritated with my voice. 

P10. People ask, "What's wrong with your voice?” 

F11. | speak with friends, neighbors, or relatives less often 
because of my voice. 

F12. People ask me to repeat myself when speaking face-to-face. 

P13. My voice sounds creaky and dry. 

P14. | feel as though | have to strain to produce voice. 


Figure 7.1 


E15. I find other people don't understand my voice problem. 
F16. My voice difficulties restrict my personal and social life. 
P17. The clarity of my voice is unpredictable. 

P18. | try to change my voice to sound different. 

F19. | feel left out of conversations because of my voice. 
P20. Iuse a great deal of effort to speak. 

P21. My voice is worse in the evening. 

F22. My voice problem causes me to lose income. 

E23. My voice problem upsets me. 

E24. Iam less outgoing because of my voice problem. 

E25. My voice makes me feel handicapped. 

P26. My voice “gives out" on me in the middle of speaking. 
E27. | feel annoyed when people ask me to repeat. 

E28. | feel embarrassed when people ask me to repeat. 
E29. My voice makes me feel incompetent. 

E30. I'm ashamed of my voice problem. 





Note. The letter preceding each item number corresponds to the 
subscale (E = emotional subscale, F = functional subscale, P = 
physical subscale). 


The Voice Handicap Index (VHI). (From Jacobson, B. H., Johnson, A., Grywalski, C., Silbergleit, A., Jacob- 


son, G., Benninger, M. S., et al., [1997]. The Voice Handicap Index [VHI]: development and validation. Am | Speech 
Lang Pathol 6, 66-70. ©1997 The American Speech-Language-Hearing Association. Reprinted with permission.) 
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VOICE-RELATED QUALITY OF LIFE (V-RQOL) MEASURE 
UNIVERSITY OF MICHIGAN 


NAME: 


DATE: 


We are trying to learn more about how a voice problem can interfere with your day to day activities. On this paper, 
you will find a list of possible voice-related problems. Please answer all questions based upon what your voice has 
been like over the past two weeks. There is no “right” or “wrong” answers. 

Considering both how severe the problem is when you get it, and how frequently it happens, please rate each item 
below on how “bad” it is (that is, the amount of each problem that you have). Use the following scale for rating 


the amount of the problem: 


Because of my voice, 


. I have trouble speaking loudly or being heard in noisy situations. 


1 = None, not a problem 

2 = A small amount 

3 = A moderate (medium) amount 
4=A lot 

5 = Problem is as “bad as it can be” 


How much of a problem is this? 


. I run out of air and need to take frequent breaths when talking 

. I sometimes do not know what will come out when I begin speaking. 
. I am sometimes anxious or frustrated (because of my voice) 

. I sometimes get depressed (because of my voice) 


. I have trouble doing my job or practicing my profession (because of my voice) 


. I avoid going out socially (because of my voice) 
. I have to repeat myself to be understood. 
. I have become less outgoing (because of my voice) 


1 

2 

3 

4 

5 

6. I have trouble using the telephone (because of my voice) 
7 

8 

9 
10. 


Computational Algorithm 


V-RQOL General Scoring Algorithm 


100 — 


Example for Total Score 
If a Raw Score is 20 


(10) 


100 — 
(40) 


(Raw Score — # items in domain or total) 
(Highest Possible Raw Score — # items) 


pet ne te t pt pt 
NNNNNNBPIERPINN 
WWW WW WWW WwW 
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x 100 


100] = 100 — (0.25 x 100) = 100 — 25 = 75 Standard Score 


Figure 7.2 The Voice-Related Quality of Life (V-RQOL) measure. (From Murry, T., Medrado, R., Hogikyan, N. D., Aviv, 
J. E. [2004]. The relationship between ratings of voice quality and quality of life measures. | Voice 18, 183-192. © 2004 


The Voice Foundation. Reprinted with permission.) 


clinical outcomes in patients with unilateral vocal fold 
paralysis (Hogikyan, Wodchis, Terrell, et al., 2000) and 
adductor spasmodic dysphonia (Hogikyan, Wodchis, Spak, 
et al, 2001; Rubin, Wodchis, Spak, et al, 2004). It also 
appears to be moderately correlated with clinician ratings 
of voice quality (Murry, Medrado, Hogikyan, et al, 2004). Ma 
and Yiu (2001) developed the Voice Activity and Participa- 
tion Profile (VAPP) in response to evolution in the World 
Health Organization’s classification system for impairment, 
disability, and handicap. This 28-item instrument assesses 
activity limitation and reduced willingness to participate 
in voice-related activities and spans five domains: self- 
perceived severity job, daily communication, social commu- 
nication, and emotion. The VAPP has been validated and 
demonstrates good reliability. 

A small number of instruments have been developed for 
outcomes measurement in specific patient populations. 
The Voice Outcome Survey (VOS; Gliklich, Glovsky, and 
Montgomery, 1999) contains five questions designed to 


measure daily activity and quality of life changes associ- 
ated with unilateral vocal fold paralysis. The Pediatric 
Voice Outcome Survey (Hartnick, 2002; Hartnick, Volk, and 
Cunningham, 2003) is specifically targeted toward the 
pediatric voice patient and is intended to be completed by 
caregiver proxy. Both of these devices have undergone ini- 
tial validation. 


¢ Auditory Perceptual Analysis 


Auditory perceptual analysis refers to the use of the human 
auditory perceptual system, often in combination with an 
external rating system, to make judgments of the nature 
and appropriateness of an individual’s voice pitch, loud- 
ness, and quality. Some suggest the ability to relate percep- 
tual judgments to the origin of dysphonias is a kind of art 
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Equal Appearing Interval 


0 l 2 3 4 5 6 


Figure 7.3 Equal appearing interval and visual analog scales. 


No breathiness | | | | | | Extreme breathiness 


Visual Analog 


No breathiness | | Extreme breathiness 


form not unlike a musician recognizing a chord from a 
Beethoven symphony or a physician’s recognition of a type 
of pathologic heartbeat or breath sound discerned from 
auscultation. Although recognized as an important clinical 
and research tool, perceptual analysis (especially the analy- 
sis of quality) has been characterized by questionable valid- 
ity and poor reliability. It is often unclear which descriptors 
are the most meaningful in representing both normal and 
pathologic variations in voice quality. Beyond this, meas- 
urement and scaling issues can add further analysis error. 
Reliability issues have been traditionally associated with 
such instrument or external task factors; however, it 
appears that the inherent ability of judges to make reliable 
judgments must also be questioned (Gerratt, Kreiman, 
Antonanzas-Barroso, et al, 1993; Kreiman, Gerratt, Kemp- 
ster, et al., 1993). 

Regardless of the critical problems that plague perceptual 
judgments of voice, this analysis modality holds clear face 
validity and functional importance. Perceptual voice 
changes can indicate the presence of vocal pathology and 
often direct individuals to seek evaluation and treatment. 
Perceptual judgments are also critical to a patient’s evalua- 
tion of change after treatment, as well as acceptance by sig- 
nificant others. Further, despite its apparent unreliability, 
the human auditory perceptual system allows complex and 
sophisticated judgments of quality, far superior in many 
ways to the algorithms that underlie acoustic indices. The 
human auditory perceptual system can process a noisy 
signal (such as a severely dysphonic voice) with more suc- 
cess than most artificial algorithms (Rabinov, Kreiman, 
Gerratt, et al, 1995). In some ways, the ability of the auditory 
perceptual system to make complex and sophisticated judg- 
ments is both an advantage and drawback, as judges (partic- 
ularly experienced judges) appear to assign differing impor- 
tance to different signal parameters, depending on the 
nature of the signal (Kreiman, Gerratt, and Precoda, 1990). 
This may be what manifests as poor reliability among 
judges. 

Contemporary theory (Kreiman, Gerratt, Kempster, et al., 
1993), supported by data from a programmatic series of 
studies (Gerratt and Kreiman, 2001; Gerratt, Kreiman, Anto- 
nanzas-Barroso, et al, 1993; Kreiman and Gerratt, 1998; 
Kreiman and Gerratt, 2000; Kreiman, Gerratt, and Precoda, 
1990; Kreiman, Gerratt, Precoda, et al, 1992), suggests that 
individuals make voice quality judgments with respect to an 


inherently unstable internal standard, established according 
to rating experience and exposure to different (normal and 
pathologic) voices. Voice quality is perceived and processed 
in a sophisticated multidimensional manner, which is gener- 
ally problematic for rating systems based on unidimensional 
scales of specific constructs (e.g., breathiness, roughness). 
Further, alternative scaling systems may vary in their ability 
to accurately capture and represent an individual’s judgment 
(Fig. 7.3). Equal appearing interval (EAI) scales (which 
require selecting a value from a series of equally graduated 
values) allow a discrete response, but if too coarse, may not 
capture fine perceptual distinctions. Visual analog scales 
(which require placing a mark on a line with labeled 
extremes) allow finer perceptual ratings, but if too fine, risk 
introducing meaningless distinctions. These individual 
judge and measurement task factors should be carefully 
considered in the design or selection of any perceptual pro- 
tocol for clinical application. 


Traditional Perceptual Analysis of Voice Protocols 


Several perceptual analysis protocols have been developed 
for the evaluation of voice. Several popular protocols, all 
developed in different countries, are introduced here. The 
GRBAS protocol, developed by the Japanese Society of Logo- 
pedics and Phoniatrics, contains five 4-point EAI scales: 
grade (overall severity), roughness, breathiness, asthenia 
(lack of vocal power), and strain (Hirano, 1981). Each param- 
eter may be rated as 0 (normal), 1 (slightly impaired), 2 
(moderately impaired), or 3 (extremely impaired). A training 
tape is available containing examples of each parameter and 
various parameter combinations. Developed in the United 
States, the Buffalo III Voice Profile employs 12 5-point EAI 
scales spanning both auditory perceptual and other vari- 
ables: laryngeal tone, pitch, loudness, nasal resonance, oral 
resonance, breath supply, muscles (hyper- or hypotension), 
voice abuse, rate, speech anxiety, speech intelligibility, and 
overall voice rating (Wilson, 1987). Additional subprofiles 
are provided for specific populations/disorders. The Stock- 
holm Voice Evaluation Approach (SVEA), developed in Swe- 
den, contains 13 parameters: aphonia/intermittent aphonia, 
breathiness, hyperfunction/tension, hypofunction/laxness, 
vocal fry/creakiness, roughness, gratings/scrapiness, unstable 
quality/pitch, voice breaks, diplophonia, modal/falsetto regis- 
ter, pitch, and loudness (Hammarberg, 2000; Hammarberg 
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Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) 


Name: 


The following parameters of voice quality will be rated upon completion of the following tasks: 


1. Sustained vowels, /a/ and /i/ for 3-5 seconds duration each. 
2. Sentence production: 

a. The blue spot is on the key again. 

b. How hard did he hit him? 

c. We were away a year ago. 


d. We eat eggs every Easter. 
e. My mama makes lemon muffins. 
f. Peter will keep at the peak. 


Figure 7.4 The Consensus Auditory 
Perceptual Evaluation of Voice (CAPE- 
V) rating form. Drafted in 2003, this 
tool for the clinical auditory-percep- 
tual assessment of voice is currently 
undergoing field testing. [© 2003] The 
American Speech-Language-Hearing 
Association, Special Interest Division 3, 
Voice and Voice Disorders. Reprinted 
with permission.) 


Date: 


3. Spontaneous speech in response to: "Tell me about your voice problem." or "Tell me how your voice is functioning.” 


Legend: C = Consistent I = Intermittent 
MI = Mildly Deviant 


MO =Moderately Deviant 
SE = Severely Deviant 





Overall Severity 


Roughness 

Breathiness 

Strain 

Pitch (Indicate the nature of the abnormality): 
Loudness (Indicate the nature of the abnormality): 





COMMENTS ABOUT RESONANCE: NORMAL 


SCORE 
C I /100 
C qJ /100 
O | /100 
C I /100 
C I __/100 
CI /100 
C -I /100 
C I /100 


OTHER (Provide description): 


ADDITIONAL FEATURES (for example, diplophonia, fry, falsetto, asthenia, aphonia, pitch instability, tremor, 


wet/gurgly, or other relevant terms): 


Clinician: 


and Gauffin, 1995). Developed in the United Kingdom, the 
Vocal Profile Analysis (VPA) scheme is an extensive, phoneti- 
cally based rating protocol containing more than 30 parame- 
ters, including vocal tract and prosodic features (Laver, 1980; 
Laver, 2000). 

Regardless of the rating paradigm employed, the clinician 
should begin attending to the perceptual characteristics of a 
patient’s voice immediately on the initial greeting and con- 
tinue throughout the case history and various assessment 
tasks. It is important to gauge both perceptual features as 
well as the appropriateness of these features for the patient’s 
sex, age, and cultural orientation. In addition, the clinician 
should note variation associated with particular assessment 
tasks or trial therapy. Perceptual voice changes under certain 


conditions may give invaluable insight into the etiology of a 
condition, as well as prognosis for treatment. 


Recent Approaches to Perceptual Analysis of Voice 


Several recent approaches to the perceptual analysis of voice 
demonstrate promise in overcoming (or at least curtailing) 
the theoretical and practical challenges that are inherent to 
this area. In 2002, a consensus meeting convened by the 
American Speech-Language-Hearing Association’s (ASHA) 
Special Interest Division on Voice and Voice Disorders 
resulted in the development of a pilot assessment instru- 
ment, named the Consensus Auditory Perceptual Evalua- 
tion of Voice (CAPE-V) (Fig. 7.4). The CAPE-V represents the 
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consensus committee’s minimum recommended standard 
for the auditory perceptual analysis of voice disorders and is 
intended to facilitate increased consistency across individu- 
als in clinical practice. The CAPE-V instrument contains six 
primary perceptual parameters (overall severity, roughness, 
breathiness, strain, pitch, loudness), which are rated using 
100-mm visual analog scales and judged as either consis- 
tently or intermittently present. Ratings are designed to be 
completed after the production of isolated vowels, sen- 
tences, and a connected speech sample. Scales are provided 
for rating any additional prominent voice parameters, and 
space is provided for describing resonance and other notable 
features. At present, the CAPE-V instrument is undergoing 
field testing, and experimental data on its validity and relia- 
bility are forthcoming. Future plans for this instrument 
include the availability of perceptual referents (anchors) and 
the development of a training system. 

Two recent approaches directly address the presumed vari- 
ability and instability of the internal standards used by raters 
in making perceptual judgments. The first approach involves 
the use of external standards, or anchors, placed at discrete 
points ona rating scale. By substituting a rater’s internal stan- 
dards with these explicit external standards, rating reliability 
is significantly improved (Chan and Yiu, 2002; Gerratt, 
Kreiman, Antonanzas-Barroso, et al., 1993). The second 
approach, named analysis by synthesis, requires judges to 
manipulate voice synthesis parameters to obtain an auditory 
perceptual match with a target sample (Gerratt and 
Kreiman, 2001). These explicit comparisons bypass the need 
for internal standard comparisons, also resulting in enhanced 
reliability across judges. 


+ Acoustic Analysis 


Acoustic analysis of vocal function is widely employed in 
clinical and research settings. By carefully examining sound 
signals radiated from the mouth, considerable information 
can be inferred regarding underlying laryngeal physiology. 
Acoustic studies are typically performed using either live or 
recorded voice samples and are therefore noninvasive. 

Whereas the potential benefits of acoustic analysis are 
many, it is important that the application and interpretation 
of these tools is based on a solid conceptualization of rele- 
vant theory. Clinicians should hold at least a rudimentary 
knowledge of source-filter theory and principles of digital 
signal processing, as well as the algorithmic makeup and 
potential susceptibility to error of specific analysis tools. By 
maintaining a balanced and informed perspective across 
these areas, clinicians will obtain the most benefit from 
acoustic instrumentation. 


Underlying Principles 
Source-Filter Theory 


First described by Fant (1960), the source-filter theory of 
speech production forms the foundation of contemporary 


understanding in vocal tract acoustics. The central tenet of 
the theory is that acoustic energy produced by a sound 
source (the glottal spectrum) is propagated through a filter- 
ing system (the vocal tract), resulting in a modified output 
waveform (Fig. 7.5). The glottal spectrum, generated by the 
vibratory pattern of the vocal folds, is rich in harmonics that 
decline in energy with increasing frequency. When the har- 
monically rich glottal waveform is transmitted through the 
supralaryngeal cavities, individual frequency components 
are maximized or minimized. Frequency bands allowing 
maximum acoustic energy transfer are termed formants. 
Formants are analogous to the natural modes of vibration 
within the vocal tract and can be derived from knowledge of 
its physical dimensions. Formants may be described in terms 
of center frequency, bandwidth, and amplitude. 

The source-filter theory is based on two primary assump- 
tions: linearity and time invariance (Kent, 1993). Linearity 
assumes that the model responds equally to several individ- 
ual inputs as to the sum of those individual inputs. Time 
invariance states that the model responds indifferently to 
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Figure 7.5 Central principles of the source-filter theory. The glottal 
spectrum (A) is propagated through the vocal tract filter (B), resulting a 
modified output spectrum (C). (From Titze, 1. R. [1994a]. Principles of 
voice production. Englewood Cliffs, NJ: Prentice-Hall. [© 1994] Ingo R. 
Titze. Reprinted with permission.) 
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input that is either time-delayed or time-advanced. These 
assumptions do not hold under every circumstance; how- 
ever, they are fairly robust and allow the application and 
interpretation of several powerful analysis tools. The major- 
ity of acoustic analysis procedures described in this chapter 
will follow source-filter assumptions; however, an introduc- 
tion to nonlinear approaches will also be provided. 


Digital Signal Processing 


Contemporary methods in the acoustic analysis of voice are 
heavily reliant upon computer technology. Initially, however, 
the acoustic signal must be converted to a digital format that 
can be readily used by a computer. This process is termed 
analog-to-digital conversion (A/D conversion) and is a form of 
digital signal processing. A/D conversion comprises translat- 
ing a continuously varying analog signal to a discretely 
defined digital signal. Three essential stages are involved: fil- 
tering, sampling, and quantization. 

The filtering operations employed in A/D conversion 
consist of equalization of acoustic energy across the entire 
frequency range (preemphasis filtering), followed by elimi- 
nation of acoustic energy above the highest frequency of 
interest (presampling filtering) (Kent and Read, 2002; 
Owens, 1993). Sampling involves taking discrete measure- 
ments of signal energy at regular time intervals (Fig. 7.6). 
Signal information from between these sampling points is 
discarded; however, assuming an appropriate sampling rate 
(measured in hertz; Hz) has been selected, the digital wave- 
form can still be considered equivalent to its analog source. 
Quantization entails translating the amplitude values of the 
signal into discrete units (Fig. 7.7). The greater the number of 
quantization levels (measured in bit rate), the higher the res- 
olution of the digitized signal. 

Each step in the A/D conversion process plays a key role in 
the preservation of signal integrity for subsequent analysis. 
Equalization is useful when dealing with speech input to com- 
pensate for the characteristic irregularity of energy distribu- 
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Figure 7.6 Digital sampling of an analog signal. Discrete measure- 
ments of signal amplitude are taken at each sampling period. (From 
Owens, F. J. [1993]. Signal processing of speech. New York: McGraw- 
Hill. © 1993 McGraw-Hill [U.S., Canada, Mexico] and Palgrave Macmil- 
lan [rest of world]. Reprinted with permission.) 
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Figure 7.7 Quantization of an analog signal. An increased number of 
quantization levels results in increased resolution of the digitized signal. 
(From Kent, R. D., Read, C. [2002]. Acoustic analysis of speech [2nd 
ed.]. Albany: Delmar Learning. [© 2002.] Reprinted with permission of 
Delmar Learning, a division of Thomson Learning: www.thomson- 
rights.com.) 


tion across low and high frequencies. Selecting an appropriate 
presampling filter cutoff and sampling rate is critical to avoid 
signal misrepresentation in the form of aliasing. Aliasing refers 
to the presence of a false waveform, reconstructed at a lower 
frequency than the original signal, and occurs when less than 
two samples are taken from each waveform cycle (Fig. 7.8). 
Thus, a valid digital representation of a signal requires a mini- 
mum sampling rate at least twice the highest frequency of 
interest, with presampling filtering of all signal energy above 
the highest frequency of interest. This rule was first stated by 
Nyquist (1928), and later proved by Shannon (1949). 
Guidelines for the acquisition of acoustic voice signals 
suggest a minimum sampling rate of 20 kHz (20,000 samples 
per second) and a minimum quantization rate of 16 bits 
(65,536 discrete levels of signal amplitude) (Titze, 1995). 
With continual increases in computing power and data stor- 
age capacity, these values can be often exceeded with negli- 
gible sacrifices in processing speed and storage availability. 


Practical Guidelines 
Data Collection 


The use of appropriate technical and procedural protocols is 
important in ensuring the integrity of data collected for 
acoustic analysis (Titze, 1995). Acoustic recording is best 
performed using a professional-grade condenser micro- 
phone, maintained at a distance of 3 to 4 cm and angle of 45 
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Time 


Signal amplitude 


Signal amplitude 


to 90 degrees from the mouth. A head-mounted microphone 
is preferable for ensuring a constant mouth-to-microphone 
distance. Ambient noise and room reverberation should be 
minimized and a sound-treated room used when possible. 
Data should be either directly digitized to computer or 
stored using digital audio tape (DAT) technology. 

For the majority of voice applications, sustained vowels 
are the most appropriate tokens for acoustic voice analysis. 
Because of the influence of vowel type, fundamental fre- 
quency (Fo), and intensity on vocal perturbation (Gelfer, 
1995; Orlikoff and Baken, 1990; Orlikoff and Kahane, 1991), 
it is important to report these variables when conducting 
such analyses. Further, accurate and reliable perturbation 
extraction requires analysis across multiple tokens, each 
containing several hundred vibratory cycles (Karnell, 1991; 
Scherer, Vail, and Guo, 1995). 


Analysis Tools 


The acoustic waveform contains an enormous amount of 
potentially useful information. A central challenge in the 
acoustic analysis of voice involves defining and extracting 
those waveform features that most accurately reflect laryn- 
geal function. As stated, the majority of techniques are 
dependent upon the source filter theory, exploiting the lin- 
ear separability of the glottal source and vocal tract function. 
Though often complex and constantly being refined, many 
algorithms continue to be vulnerable to error, particularly in 
the analysis of severely dysphonic voices (Rabinov, Kreiman, 
Gerratt, et al., 1995). Nevertheless, when appropriately 
applied and carefully interpreted, acoustic measures hold 
significant value in describing vocal function. 

After a consensus meeting in 1994, The National Center for 
Voice and Speech (NCVS) issued a summary statement rec- 
ommending analysis selection based on signal typing (Titze, 
1995). Three signals types were differentiated in terms of 
periodicity and the presence of bifurcations. Signals that are 
nearly periodic with no bifurcations or strong modulating 
frequencies (denoted type 1 signals) may be appropriately 





Figure 7.8 Aliasing of a digitized signal due to an inadequate sam- 
pling rate (sampling below the Nyquist frequency). (From Owens, 
F. J. [1993]. Signal processing of speech. New York: McGraw-Hill. 
[© 1993] McGraw-Hill [U.S., Canada, Mexico] and Palgrave Macmil- 
lan [rest of world]. Reprinted with permission.) 
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analyzed using perturbation indices. Signals that contain 
bifurcations and/or strong modulating frequencies (denoted 
type 2 signals) may be best analyzed using visual displays 
that permit appreciation of qualitative changes within the 
analysis segment (e.g., nonlinear phase plots, spectrograms, 
Fo contours). Signals that are completely aperiodic (denoted 
type 3 signals) may be best analyzed using perceptual rat- 
ings. If adhered to, these consensus guidelines can help pro- 
tect against inappropriate analyses and maximize the utility 
of acoustic data. The following section outlines analysis tools 
commonly employed in the acoustic assessment of voice. 


Fundamental Frequency Fundamental frequency (Fo) cor- 
responds with the rate of oscillation of the vocal folds 
(expressed in Hz), and is a primary factor in the perception of 
pitch. The accurate determination of Fp by computer is 
extremely challenging and has been identified as a crucial 
problem in the acoustic analysis of voice (Read, Buder, and 
Kent, 1992). In an attempt to improve the accuracy of Fo cap- 
ture, scientists have approached the acoustic signal using a 
variety of methods (Fig. 7.9). Algorithms such as peak-picking 
and zero-crossing measure the interval between discrete sig- 
nal events (such as amplitude peaks or the upward crossing of 
a horizontal axis) to determine the fundamental period. Strate- 
gies such as autocorrelation match the waveform with a time- 
delayed copy of itself, assuming that the copy with the best 
match is delayed by the fundamental period. Cepstral analysis 
involves performing sequential Fourier transformations on the 
signal and measuring the quefrency (equivalent to the funda- 
mental period) at the point of greatest cepstral energy. 

Each of these approaches to Fp extraction is vulnerable to 
contamination by signal noise and periodicity, irregularities 
in waveform morphology, rapid changes in formant struc- 
ture, and the presence of subharmonics. These difficulties 
may be partially overcome by low-pass filtering the signal 
prior to analysis (Titze and Liang, 1993), in addition to appro- 
priate signal type selection (Titze, 1995). Comparison 
between Fo extraction methods appears to suggest that 
strategies based on waveform matching may be the most 
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reliable, at least when variations in Fo are less than 6% (Titze 
and Winholtz, 1993). Incorporating at least two extraction 
methods into an analysis and comparing the results across 
algorithms is another method of guarding against instru- 
ment failure (Read, Buder, and Kent, 1992). 

Fo is primarily modulated by changes in vocal fold length, 
mass, and tension. Variation in one or more of these param- 
eters can occur as the result of normal individual differences 
or the presence of pathology. Adult vocal folds are longer and 
have greater mass than those of children, resulting in a lower 
Fo; likewise, male vocal folds are longer and have greater 
mass than those of females, resulting in a lower Fp (Eckel, 
Koebke, Sittel, et al., 1999; Hirano, Kurita, and Nakashima, 
1983). Hormonal changes influencing vocal fold mass and 
tension may also contribute to changes in Fo in older adults 
(Hollien, 1987). Excessively high Fy may result from 
restricted vocal fold length and mass (e.g., due to a laryngeal 
web, or androgen deficiency in pubescent males) or elevated 
musculoskeletal tension and laryngeal posture. Excessively 
low Fo may result from increased vocal fold mass (e.g., due to 
vocal fold edema or polypoid degeneration or androgen 
exposure in females) or decreased tension (e.g., due to supe- 
rior laryngeal nerve neuropathy and impaired cricothyroid 
function). 

Fo may be measured using a sustained vowel token pro- 
duced at a comfortable pitch level, a specified target within a 
patient’s Fy range, or during a speaking or reading task. A 
sustained vowel production is useful in obtaining an Fo trace 
across time, which may reveal the presence of bifurcations 
or subharmonics, as well as long-term vocal instability such 
as tremor. Speaking and reading tasks (best performed using 
nonemotive material) allow an estimation of mean speaking 
Fo, as well as the standard deviation of speaking Fo (also 
called pitch sigma). Measurement of Fp range can highlight 
difficulties with vocal fold length and tension manipulation, 
neuromuscular control, or reduced tissue pliability. 


Zero-cross line 


Figure 7.9 Illustration of Fo extraction using zero- 
crossing and autocorrelation algorithms. In zero-crossing, 
the fundamental period is determined from adjacent 
upward crossings of the zero-cross line. In autocorrela- 
tion, the waveform is matched with time-delayed copies 
of itself. The waveform copy with the strongest correla- 
tion is delayed by the fundamental period. 


Typical speaking, Fp values vary as a function of age, sex, 
psychologic state, loudness, and speech task. Normative val- 
ues have been summarized by Baken and Orlikoff (2000). 
Prepubescent males and females have comparable Fo values 
of around 220 to 240 Hz, which differentiate to ~100 to 120 
Hz and 200 to 220 Hz in adult males and females, respec- 
tively. This sex difference generally narrows beyond the sev- 
enth decade, as speaking Fo increases in males and decreases 
slightly in females (Fig. 7.10). 
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Figure 7.10 Changes in speaking Fo across the life span, for trained and 
untrained voices. (From Titze, |. R. [1994a]. Principles of voice produc- 
tion. Englewood Cliffs, NJ: Prentice-Hall. [© 1994] Ingo R. Titze. Based on 
data reported in Brown, W. S., Morris, R. J., Hollien, H., Howell, E. [1991]. 
Speaking fundamental frequency characteristics as a function of age and 
professional singing. | Voice 5, 310-315. Reprinted with permission.) 
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Intensity Vocal intensity, the physical correlate of loud- 
ness, varies as a function of both subglottal pressure (Ps) and 
vocal fold vibratory amplitude. Increasing Ps during normal 
phonation will result in increased vibratory amplitude and 
mucosal wave excursion. Elevated intensity in a normal con- 
versational situation may reflect difficulty of self-monitoring 
due to a conductive hearing loss. Reduced intensity may be 
indicative of poor respiratory support, incomplete glottal 
closure, or reduced tissue pliability restricting vocal fold 
vibratory amplitude. In addition, reduced tissue pliability 
may contribute to difficulty achieving vocal fold oscillation 
at low Ps, limiting voice production at low intensities. 

Intensity is expressed using the decibel (dB) and can be 
simply measured using a sound-level meter. Alternatively, 
intensity can be measured using the amplitude of an appro- 
priately calibrated acoustic voice signal. As with Fo, intensity 
measurements can be made during a production of a sus- 
tained vowel token, during a speaking or reading task, or at 
intensity extremes (for the measurement of dynamic range). 
Vocal intensity tends to increase alongside increases in Fo, 
varies as a function of physical and psychologic state, and is 
typically 2 dB greater in both children compared with adults 
and in males compared with females (Coleman, Mabis, and 
Hinson, 1977; Susser and Bless, 1983). 


Voice Range Profile The voice range profile (VRP), or 
phonetogram, combines Fy and dynamic range data in a 
visual representation of the physiologic limits of the vocal 
system (Fig. 7.11). The VRP can be obtained using a keyboard 
and sound level meter, although automated computer pro- 
grams are also available to assist with data collection and 
presentation. Intensity is typically presented in dBs and Fo 
using either linear frequency (Hz), logarithmic semitones 
(ST), or percentage of total range. The VRP for a normal voice 
has a general form characterized by greater dynamic range in 
the midfrequency region and tapering near the frequency 
extremes. The contours for minimum and maximum inten- 
sity also tend to increase alongside increasing Fo. The VRP 
can be described both qualitatively and quantitatively. Varia- 
tions have been reported as a function of age, sex, history of 
vocal training, and the presence of vocal pathology (Speyer, 
Wieneke, van Wijck-Warnaar, et al., 2003; Sulter, Schutte, 
and Miller, 1995; Teles-Magalhaes, Pegoraro-Krook, and 
Pegoraro, 2000; Wuyts, Heylen, Mertens, et al., 2003). 
Methodological considerations have been outlined by Cole- 
man (1993). 


Spectrography and Spectral Measures Spectrography is a 
powerful analytic technique providing insight into both glot- 
tal source and vocal tract filter functions. The spectrogram is 
obtained by transformation of the acoustic waveform from 
the time domain to the frequency domain, specifically by the 
application of Fourier’s theorem. The resulting display pres- 
ents frequency on the vertical axis, time on the horizontal 
axis, and intensity as the darkness of the plot (Fig. 7.12). 
Spectrograms are extracted using a variable pass-band filter, 
which at a narrow-band setting (around 50 Hz) captures 
individual harmonics produced by the glottal source and ata 
wide-band setting (300 to 500 Hz) captures formants res- 
onating within the vocal tract (Fig. 7.13). Precise filter set- 
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Figure 7.11 Mean voice range profiles (VRPs) collected from a series 
of (A) males and (B) females. Dark lines represent untrained voices. 
Light lines represent trained voices. (Constructed based on data from 
Sulter, A. M., Schutte, H. K., Miller, D. G. [1995]. Differences in phone- 
togram features between male and female subjects with and without 
vocal training. | Voice 9, 363-377.) 


tings are mandated by the Fo of the source signal (e.g., a 
wide-band filter setting for an adult male may result in a 
narrow-band spectrogram when used with a child’s voice). 

In addition to providing harmonic and formant information, 
the spectrogram can provide qualitative insight into abnormal 
voice quality. Whereas a normal voice signal contains well- 
defined harmonics/formants, a dysphonic voice signal is often 
characterized by weak and irregular harmonic/formant pat- 
terns and high-frequency noise (Yanagihara, 1967). Bifurca- 
tions may also be observed in type 2 signals. 

Spectral information can also be extracted using fast 
Fourier transformation (FFT) or linear predictive coding 
(LPC) (Owens, 1993). These mathematical procedures allow 
the derivation of frequency and amplitude data for a short 
time window. A related measure, the long-term average 
spectrum (LTAS), provides spectral information across an 
extended window and can be used with reading and conver- 
sational speech samples (Fig. 7.14). Differences in peak 
energy region, as well as spectral shape and tilt, have been 
reported as a function of sex, age, and the presence of vocal 
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Figure 7.12 Acoustic signal and spectrogram of 
Tal i H we i ‘ul M paeem Tiana han the phrase “Kate cut the cake.” 
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Figure 7.15 The cepstral waveform. The magni- 
tude of the dominant rahmonic, situated at the 
fundamental period, has been associated with 
voice quality. Here, a quefrency of 8 milliseconds 
indicates an Fp of 125 Hz. 
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pathology (Kitzing and Akerlund, 1993; Linville, 2002; Men- 
doza, Valencia, Munoz, et al., 1996; White, 2001). 

Cepstral analysis, introduced earlier as a useful tool in the 
extraction of Fo, also holds application in the acoustic analy- 
sis of voice quality. The dominant cepstral rahmonic, situ- 
ated at the fundamental period, is an indicator of the 
strength of the rahmonic component of the input signal 
(Fig. 7.15). Several studies have reported an association 
between dominant harmonic magnitude and perceptual rat- 
ings of voice quality (Dejonckere, 1998; Dejonckere and 
Lebacq, 1996; Hillenbrand, Cleveland, and Erickson, 1994; 
Hillenbrand and Houde, 1996). 


Perturbation Measures Perturbation refers to the pres- 
ence of cycle-to-cycle variation in Fp, amplitude, and wave- 
form morphology. Jitter refers to cycle-to-cycle variation in 
signal frequency (Fig. 7.16). Shimmer refers to cycle-to-cycle 
variation in signal amplitude. Indices such as signal-to-noise 
ratio (SNR; Klingholz, 1987) and harmonics-to-noise ratio 
(HNR; Yumoto, Gould, and Baer, 1982) are more general 
measures of perturbation across Fp, amplitude, and wave- 
form morphology. In normal voices, some degree of cycle-to- 
cycle perturbation is expected; in contrast, excessive levels 
of perturbation have been associated with the presence of 
vocal pathology (Laver, Hiller, and Mackenzie Beck, 1992). 
Physiologically, the presence of perturbation in the voice 
signal is indicative of some degree of irregularity in vocal 
fold vibration. Several specific contributing mechanisms 
have been hypothesized. These include momentary fluctua- 
tions in neuromuscular activity (Orlikoff, 1989), subtle 
asymmetries in vocal fold shape and stiffness, aerodynamic 


Acoustic signal 


turbulence (Titze, 1994a), and systolic pressure shifts within 
vocal fold blood vessels (Orlikoff, 1990a,b). Although these 
factors likely contribute to the small degree of waveform 
aperiodicity observed in normal voices, more severe pertur- 
bations are almost always secondary to pathology. 

Whereas the existence of perturbation in the vocal signal 
is universally agreed upon, no consensus has been reached 
regarding its precise mathematical definition. As stated by 
Titze (1994a), there are many ways to approach the quantifi- 
cation of pattern deviation. It is perhaps for this reason that 
so many perturbation algorithms have emerged. Absolute 
measures reflect absolute differences in duration or ampli- 
tude across sequential vocal cycles and are easily converted 
into percentage values for the entire analysis window. Rela- 
tive measures (e.g., relative average perturbation, frequency 
perturbation quotient, amplitude perturbation quotient) 
have been developed to account for long-term trends in Fo or 
amplitude, often by deriving a second-order perturbation 
function. SNR and HNR indices calculate the relative pres- 
ence of harmonic and noise energy within a signal. The NCVS 
consensus guidelines for perturbation measurement recom- 
mend using mean absolute measures by default and second- 
order measures only when faced with a clear long-term 
trend (Titze, 1995). Signal typing is always necessary to 
determine the appropriateness of any perturbation measure. 


Nonlinear Measures Nonlinear approaches to the acoustic 
analysis of voice are centered on the fact that the complex 
and often unpredictable nature of vocal fold vibration is not 
adequately accounted for by linear assumptions (such as 
which underlie the source filter theory of speech production). 





Fo trace 


Figure 7.16 Fo trace of a sus- 
tained vowel produced by a 
normal speaker, illustrating 
subtle irregularities across time. 
These cycle-to-cycle irregulari- 
ties reflect frequency perturba- 


Mean Fg = 126.3 Hz tion, or jitter. 
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Figure 7.17 Phase spaces calculated from acoustic signals collected 
from a single patient (A) before and (B) after phonosurgery for the 
removal of vocal fold polyps. The postsurgical plot reveals greater tra- 
jectory stability than the presurgical plot. (From Zhang, Y., McGilligan, 
C., Zhou, L., Vig, M., Jiang, J. J. [2004]. Nonlinear dynamic analysis of 
voices before and after surgical excision of vocal polyps. J Acoust Soc 
Am 115[5 Pt 1], 2270-2277. © 2004 Acoustical Society of America. 
Reprinted with permission.) 


These analysis approaches are rooted in nonlinear dynamic 
theory, which views the output of certain complex systems 
as not random but characterized by inherent system nonlin- 
earities. Such systems are termed chaotic and by definition 
must be deterministic (governed by initial conditions and 
rules), nonlinear, unpredictable, low-dimensional (con- 
trolled by relatively few parameters), and highly sensitive to 
initial conditions (whereby small changes grow exponen- 
tially with time) (Baken, 2003). 

A popular method for displaying the output of a nonlinear 
dynamic system is to plot its trajectory in a phase space (Fig. 
7.17). A phase space represents the independent variables of 
a system as dimensions in the space. Any variable of interest 
can be examined (e.g, mass, displacement, velocity, time, 
and so forth); however, in the case of an acoustic signal rep- 
resented in the time domain, it is common practice to convert 
the signal into phase space using a time delay coordinate for 
each dimension (Herzel, Berry, Titze, et al., 1994). As a signal 
is represented in phase space over time, it may shift from an 


unstable toward a stable or asymptotic trajectory. This stable 
trajectory is termed an attractor. Herzel, Berry, Titze, et al., 
(1994) described four attractor types: stable stationary 
states, limit cycles (characterized by a limiting trajectory), 
tori (representing multiple superimposed oscillators at inde- 
pendent frequencies), and chaotic (nonperiodic). 

Several investigators have demonstrated the applicabil- 
ity of nonlinear dynamic analysis to the clinically disor- 
dered voice (Giovanni, Ouaknine, and Triglia, 1999; Herzel, 
Berry, Titze, et al., 1994; Herzel, Berry, Titze, et al., 1995; 
Zhang, McGilligan, Zhou, et al., 2004). These studies have 
used phase portraits and other qualitative methods for 
identifying system attractors, in addition to a range of 
quantitative measures, such as Lyapunov exponents (which 
reflect the rate of divergence [or convergence] of initially 
neighboring trajectories over time), dimension measures, 
and various entropy measures (which reflect the degree of 
disorder and rate of dynamic system information loss over 
time). 


+ Aerodynamic Analysis 


Aerodynamic analysis of voice involves measuring changes 
in air volume, flow, and pressure during phonation. Variation 
in these parameters allows insight into both respiratory and 
laryngeal performance. In many cases, a variety of measure- 
ment and analysis approaches are available to the clinician 
or scientist. These approaches vary considerably in terms of 
the need for specialized equipment, as well as the degree of 
invasiveness imposed on the patient. 

As with acoustic analysis, aerodynamic studies are often 
dependent upon digital signal processing techniques and, in 
the case of inverse filtering of the flow waveform, source-fil- 
ter theory. Contemporary data acquisition systems employ 
flow and pressure transducers, which produce analog voltage 
signals for A/D conversion. These signals, as with acoustic 
signals captured using a microphone transducer, are vulner- 
able to aliasing in the hands of an inexperienced user. These 
signals must also be appropriately calibrated using a stan- 
dard instrument (e.g., a U-tube manometer for pressure, a 
flowmeter or rotameter for flow). 


Volume 


Volume refers to the three-dimensional amount of air either 
present or displaced within a system. Common air-volume 
measurements include total lung volume, vital capacity 
(maximum exhaled volume after maximum inhalation), 
tidal volume (mean volume transported during resting 
breathing), inspiratory reserve volume (maximum forced 
inspiratory volume immediately after a resting inspiratory 
cycle), expiratory reserve volume (maximum forced expira- 
tory volume immediately after a resting inspiratory cycle), 
and residual volume (volume remaining after forceful 
exhalation). Whereas these measures provide useful infor- 
mation concerning the integrity of the respiratory system, 
it is important to note that the air volume required to 
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sustain voice and speech for typical communication is con- 
siderably less than that required to sustain life. With the 
exception of singing, which may be initiated using close to 
100% of vital capacity (Watson and Hixon, 1985), most voic- 
ing tasks do not tax the volumetric limits of an intact respi- 
ratory system. 

Pulmonary function values vary systematically with age, 
sex, and height (Baldwin, Courand, and Richards, 1948), as 
well as the presence of obstructive or restrictive pulmonary 
disease. Vital capacity is typically greater in males compared 
with females, adults compared with children, tall individuals 
compared with short individuals, and young adults com- 
pared with older adults. Racial-ethnic differences have also 
been reported (Golshan, Nematbakhsh, Amra, et al., 2003; 
Zheng and Zhong, 2002). Several researchers have offered 
predictive equations for the estimation of pulmonary func- 
tion values, traditionally using linear regression but more 
recently using polynomial regression functions (Glindmeyer, 
Lefante, McColloster, et al., 1995). 

Volume data can be collected using a variety of direct and 
indirect techniques. Wet spirometry relies on the air-volume 
displacement of water within a closed system; however, it is 
not always sensitive to the fast-moving volume displace- 
ments associated with speech. Alternatively, volume can be 
derived from mean flow data collected using a pneumota- 
chograph. Other methods rely on the relationship between 
volume displacement and movements of the chest wall (typ- 
ically measured at the ribcage and abdomen), which have 
been measured using magnetometers (Hixon, Goldman, and 
Mead, 1973; Hixon, Mead, and Goldman, 1976) and induc- 
tance plethysmographs (Russell and Stathopoulos, 1988). 
Although providing valuable information on the relative 
contribution of the ribcage and abdomen to volume changes 
during speech breathing events, these tools are mostly 
employed in research settings. 


Phonation Volume 


The maximum volume of air exhaled on a sustained phona- 
tory task (after maximum inhalation) is called phonation 
volume (PV; Yangihara and von Leden, 1967). PV is the prod- 
uct of maximum phonation time (MPT) and flow (both dis- 
cussed below) and is directly related to vital capacity. 
Because of this, PV is influenced by the same factors that 
influence vital capacity, namely age, sex, and height. 


Maximum Phonation Time and the S/Z Ratio 


Two traditionally popular indirect clinical measures of respi- 
ratory integrity and laryngeal valving efficiency are MPT and 
the s/z ratio. MPT reflects the maximum duration of a sus- 
tained vowel after maximum inhalation and is typically 
averaged across multiple trials. As inferred above, MPT is a 
direct function of PV and flow and is also influenced by vital 
capacity. Thus, any variation in MPT may be a function of 
altered PV (reflecting respiratory support for phonation) 
and/or altered flow (reflecting glottal efficiency). In other 
words, MPT does not allow the differentiation of respiratory 
versus laryngeal valving issues. MPT is influenced by the 
same variables that influence PV and vital capacity. 


The s/z ratio (Boone, 1977) compares the maximum dura- 
tion of sustained /s/ and /z/ fricatives after maximum inhala- 
tion and across multiple trials. The principle underlying this 
measure is an assumption that maximum glottal efficiency 
will result in equal duration for both the /s/ and /z/ fricatives, 
yielding a theoretical ratio of 1.0. Any reduction in glottal 
efficiency should result in decreased /z/ duration relative to 
/s/, resulting in a ratio greater than 1.0. Several studies have 
reported s/z ratios below 1.0 in the normal population 
(Sorenson and Parker, 1992; Tait, Michel, and Carpenter, 
1980), presumably due to an additional flow impedance 
source during /z/ production (glottal and lingual) compared 
with /s/ (lingual only). In an early study of the clinical appli- 
cability of this tool, Eckel and Boone (1981) reported ratios in 
excess of 1.4 for 95% of individuals with documented vocal 
fold lesions. Subsequent research has failed to replicate this 
finding (Hufnagle and Hufnagel, 1988; Rastatter and Hyman, 
1982; Sorenson and Parker, 1992), lending question to the 
sensitivity of this measure. 

As with any maximum performance task, MPT and the s/z 
ratio are susceptible to variability both within and across 
individuals. This variability may be controlled to some degree 
by detailed instruction, coaching, and repeated trials. Kent, 
Kent, and Rosenbek (1987), in summarizing the literature on 
this topic, concluded that it may take as many as 10 trials to 
obtain a true MPT value. It is important to remember that this 
may not be clinically practical and may even introduce 
fatigue. Other (less easily controlled) sources of variability 
in these indices may include the use of novel glottal valving 
and lingual articulation strategies in an attempt to sustain 
constant phonatory/fricative output while compensating 
for decreasing lung volumes (Christensen, Fletcher, and 
McCutcheon, 1992; Solomon, Garlitz, and Milbrath, 2000). 


Flow 


Flow is a measure of volume velocity, or air volume dis- 
placed as a function of time. Mean flow (also referred to as 
DC flow) can be simply derived from volume and time for a 
given phonatory task. Alternatively, mean flow can be meas- 
ured using a pneumotachograph (Rothenberg, 1973, 1977) or 
hot wire anemometer (Kitajima, 1985; Woo, 1986; Woo, 
Colton, and Shangold, 1987) placed at the mouth, based on 
the assumption that an open vocal tract (during vowel pro- 
duction) has a negligible influence on flow. Such measure- 
ment of mean flow can provide insight into laryngeal valving 
efficiency. High flow rates may reflect an incomplete closure 
pattern and/or short closed phase, whereas low flow rates 
may reflect a pressed phonatory pattern and long closed 
phase. Normative data for mean flow are highly variable 
(Hirano, 1981; Wilson and Starr, 1985), possibly reflecting a 
wide range of laryngeal valving strategies across individuals. 
Because of this, mean flow is not a sensitive measure in iso- 
lation. Several derivative aerodynamic indices, of which 
mean flow is a component, are discussed below. 


Inverse Filtering 


Rothenberg (1973) introduced a technique whereby the 
flow signal captured at the mouth is processed to remove 
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the filtering influence of the vocal tract, yielding an estimate 
of the actual glottal flow waveform. This process, termed 
inverse filtering, can also be applied to the radiated acoustic 
signal (Miller and Matthews, 1963). Inverse filtering is inher- 
ently appealing, as it allows the noninvasive estimation of a 
glottal phenomenon; however, it also holds limitations. Most 
critically, eliminating the influence of the vocal tract requires 
anticipating its resonating properties. This is often difficult 
to achieve with complete precision, and so the inverse filters 
are often fine-tuned to obtain an idealized glottal flow wave- 
form. Although potentially useful with individuals with nor- 
mal voices, this methodology is susceptible to bias and error 
when analyzing pathologic voices with deviant glottal wave- 
forms. 

The inverse filtered flow waveform, or flow glottogram, 
can be characterized in several ways (Hillman, Holmberg, 
Perkell, et al., 1989; Holmberg, Hillman, and Perkell, 1988; 
Perkell, Hillman, and Holmberg, 1994). Major time-based 
measures include the fundamental period, open quotient 
(ratio of total glottal open time to the period), speed quotient 
(ratio of opening to closing time), and closing quotient (ratio 
of closing time to the period) (Fig. 7.18). Amplitude-based 
measures include peak flow (maximum flow, presumably at 
the point of greatest glottal opening during the vibratory 
cycle), minimum flow (flow during the closed phase of the 
vibratory cycle, indicative of glottal leakage), and AC flow 
(the difference between minimum and maximum flow) (Fig. 
7.19). In making these measurements, it is important to note 
that the glottal flow waveform is similar but distinct from 
the glottal area and impedance functions, primarily due to 
the influence of inertia on the accelerating and decelerating 
air stream during each vibratory cycle (Rothenberg, 1973). 
Because of this, measures derived from the glottal flow 
waveform will not directly correspond with those derived 
using high-speed imaging or electroglottography (EGG). 
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Figure 7.18 The inverse filtered flow waveform, or flow glottogram. 
The waveform can be delineated into open (t4 + t2), closed (t3), open- 
ing (tı), and closing (t2) phases. Time-based measures include the fun- 
damental period (T), open quotient ([t; + t2]/T), speed quotient (t,/t2), 
and closing quotient (t2/7). (From Hillman, R. E., Holmberg, E. B., 
Perkell, J. S., Walsh, M., Vaughan, C. [1989]. Objective assessment of 
vocal hyperfunction: an experimental framework and initial results. J 
Speech Hear Res 32, 373-392. [© 1989] American Speech-Language- 
Hearing Association. Reprinted with permission.) 
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Figure 7.19 The inverse filtered flow waveform and its first derivative. 
Amplitude-based measures include AC flow (7), minimum flow (2), and 
peak flow (3). Maximum flow declination rate (MFDR) (4) is extracted 
from the first derivative. (From Hillman, R. E., Holmberg, E. B., Perkell, 
J. S., Walsh, M., Vaughan, C. [1989]. Objective assessment of vocal 
hyperfunction: an experimental framework and initial results. | Speech 
Hear Res 32, 373-392. [© 1989] American Speech-Language-Hearing 
Association. Reprinted with permission.) 


Additional derivative flow measures include the AC/DC 
ratio (Isshiki, 1981) and maximum flow declination rate 
(MFDR) (Holmberg, Hillman, and Perkell, 1988). The AC/DC 
ratio, reflecting the difference between modulated AC flow 
(the alternating portion of the flow signal induced by vocal 
fold vibration) and unmodulated DC (mean) flow, is consid- 
ered a pseudo-efficiency index that does not rely on the 
measurement of Ps or acoustic output. The AC/DC ratio 
reflects the efficiency of the vocal folds in converting DC to 
AC flow energy; thus, any leakage at the glottis during 
phonation should result in a relative increase in DC flow and 
corresponding drop in the AC/DC ratio. Isshiki (1981) 
reported an AC/DC ratio of 0.5 or greater for normal voices; 
however, other reports have questioned this criterion due to 
considerable variability across individuals (Holmberg, Hill- 
man, and Perkell, 1988; Wilson and Starr, 1985). 

Holmberg, Hillman, and Perkell (1988) introduced the 
MFDR as an estimate of maximum vocal fold velocity during 
the closing phase of the vibratory cycle. This index is derived 
by differentiating the glottal flow waveform and measuring 
the magnitude of the dominant negative peak (Fig. 7.19). 
Alongside speed and closing quotients measured directly 
from the glottal flow waveform, MFDR is theoretically asso- 
ciated with the magnitude of vocal fold collision forces, 
which when elevated may indicate susceptibility to vocal 
fold trauma. MFDR has been correlated with vocal intensity 
(Holmberg, Hillman, and Perkell, 1988) and appears to be 
elevated in the presence of certain hyperfunctional dyspho- 
nias (Hillman, Holmberg, Perkell, et al., 1989). It is also 
greater in men compared with women (Holmberg, Hillman, 
and Perkell, 1988) and in adults compared with children 
(Sapienza and Stathopoulos, 1994). 
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Pressure 


Pressure refers to force exerted against the unit area of a 
given surface. In the establishment and maintenance of vocal 
fold oscillation, the pressure exerted against the inferior sur- 
face of the vocal folds (subglottal pressure, P,) is of primary 
interest. P, can be measured using several approaches, both 
direct and indirect. Direct measurement approaches are 
highly invasive and involve either the placement of minia- 
ture pressure transducers above and below the glottis 
(Kitzing and Lofqvist, 1975; Lofqvist, Carlborg, et al., 1982) or 
tracheal puncture using a hypodermic needle coupled to a 
pressure transducer (Bard, Slavit, McCaffrey, et al, 1992; 
Plant and Hillel, 1998). Indirect approaches include place- 
ment of an esophageal balloon to measure pressure changes 
due to displacement of the tracheal-esophageal wall (Lieber- 
man, 1968) or the estimation of P, from intraoral pressure 
(Pio) during specific speech tasks (Smitheran and Hixon, 
1981). 

The indirect P, estimation technique described by 
Smitheran and Hixon (1981) has gained popularity in both 
clinical and research applications. This technique requires 
the patient to produce a train of /pi/ syllables at a constant Fo, 
intensity and rate, and is based on the assumption that at the 
moment of bilabial closure and vocal fold abduction for plo- 
sive production, the vocal tract is a closed system with equal 
pressure throughout its length. Measurement is made by 
averaging adjacent pressure peaks in the syllable train 
7.20). Several reports indicate high correlations between this 
indirect technique and other direct measurements of Ps 
(Hertegard, Gauffin, and Lindestad, 1995; Lofqvist, Carlborg, 
and Kitzing, 1982). 

Phonation threshold pressure (Pin), the minimum Ps 
required to initiate vocal fold oscillation, is an index with 
potentially high clinical relevance. P,, is theoretically asso- 
ciated with changes in vocal fold tissue viscosity (Titze, 
1988) and should therefore be elevated in disorders charac- 
terized by reduced tissue pliability, such as vocal fold scar- 
ring and sulcus vocalis. P,, appears to vary as a function of 
hydration status and corresponds with self-perceived rat- 
ings of vocal effort (Fisher, Ligon, Sobecks, et al., 2001; 
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Figure 7.20 Intraoral pressure (P,,) waveform for a train of four /pi/ syl- 
lables. Adjacent pressure peaks are averaged to obtain pressure values. 


Sivasankar and Fisher, 2003; Verdolini, Titze, and Fennell, 
1994; Verdolini-Marston, Titze, and Druker, 1990). A rela- 
tionship between P,, and vocal fatigue has also been 
reported (Solomon and DiMattia, 2000; Solomon, Glaze, 
Arnold, et al., 2003). 

Several derivative aerodynamic indices rely on the esti- 
mation of P,. Laryngeal airway resistance (Raw) is an estimate 
of glottal resistance to flow and is calculated by dividing Ps 
by mean flow (Smitheran and Hixon, 1981). High Riaw may 
be seen in cases of reduced tissue pliability or hyperfunc- 
tion; low Rjay may be seen in cases of glottal incompetence 
or incomplete vibratory closure. Normative data indicate 
that Rjay varies alongside intensity and is typically greater 
in smaller compared with larger larynges (i.e., in women 
compared with men, children compared with adults) 
(Holmes, Leeper, and Nicholson, 1994; Netsell, Lotz, Peters, 
et al., 1994). Studies addressing age-related changes have 
reported lower Rjaw values in older males but not in older 
females (Hoit and Hixon, 1992; Melcon, Hoit, and Hixon, 
1989). 

Vocal efficiency refers to the efficiency of the laryngeal 
apparatus in converting aerodynamic to acoustic energy. 
This index is most often calculated by dividing radiated 
vocal intensity (acoustic power) by the product of Ps and 
mean flow (aerodynamic power). Natural power losses may 
occur during this conversion process due to vocal fold tis- 
sue viscosity, turbulent flow at the glottal exit, and the 
absorption of acoustic energy by vocal tract tissues (Titze, 
1994a). The presence of vocal pathology may elevate power 
loss and reduce efficiency to an even greater extent (Jiang, 
Stern, Chen, et al., 2004). Vocal efficiency increases along- 
side increases in Fp and intensity, requiring careful control 
of these variables when making comparisons (Titze, 
1994a). 


¢ Electroglottography 


EGG, also called electrolaryngography, is an electrical 
impedance-based technology for inferring vocal fold contact 
changes during phonation. The central tenet of this proce- 
dure is that electrical impedance across the neck varies sys- 
tematically with changes in the degree of vocal fold contact. 
Complete vocal fold contact is associated with low imped- 
ance and direct current flow across the glottis. As vocal fold 
contact decreases, the high impedance of air effects a redi- 
rection of current flow around the open glottis, resulting in a 
reduction in effective voltage across the neck. These voltage 
changes form the basis of the vocal fold contact (or EGG) 
signal. 

EGG is performed using two electrodes placed on the thy- 
roid laminae. Electrical current passes between the elec- 
trodes in both directions, and the neck effectively acts as a 
variable resistor in a constant current circuit (Orlikoff, 1998). 
It is important to note that the EGG signal is influenced by a 
variety of impedance factors besides that of air at the open 
glottis. Extrinsic laryngeal muscle contraction, laryngeal pos- 
tural adjustments, and blood vessel dilation and constriction 
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Vocal fold contact area 


Figure 7.21 The electroglottographic signal. The signal can be delin- 
eated into contact (t, + t2), minimal contact (f3), increasing contact 
(tı), and decreasing contact (t2) phases. Time-based measures include 
the fundamental period (T), contact quotient ([t; + t2]/T) and contact 
index ([t2 — t]/[t + t2]). 


all contribute to gross neck impedance. These factors are typ- 
ically removed by high-pass filtering of the raw EGG signal. 
Other potentially challenging impedance factors include 
excess adipose tissue in the neck, which precludes the use of 
EGG in some patients, and the presence of mucus strands, 
which can provide a route for direct current flow across an 
open glottis, thus mimicking vocal fold contact. 

As noted, the EGG signal reflects changes in vocal fold con- 
tact area as a function of time (Fig. 7.21). It is important to 
stress that although the EGG signal corresponds with the 
vibratory cycle, it is clearly distinct from the flow glottogram 
obtained via inverse filtering and the glottal width and area 
functions obtained using high-speed imaging techniques. 
The EGG signal is only meaningful when the vocal folds have 
some degree of contact; it does not specify the region of 
vocal fold contact, and it cannot be used to accurately deter- 
mine the precise moment of initial contact or separation 
(Colton and Conture, 1990; Orlikoff, 1998). Further, the point 
of maximum contact in the EGG signal does not necessarily 
correspond with complete glottal closure. As a consequence 
of these physiologic realities, the EGG signal has been char- 
acterized in terms of a contact phase (marked by clearly 
increasing or decreasing vocal fold contact) and a minimal 
contact phase (featuring an apparent lack of vocal fold con- 
tact) (Orlikoff, 1998). Based on this delineation, several 
quantitative parameters have been developed to describe 
the EGG signal (Fig. 7.21). The contact quotient is defined as 
a ratio of the contact phase duration to the fundamental 
period (Orlikoff, 1991; Rothenberg and Mahshie, 1988; 
Scherer, Vail, and Rockwell, 1995). This quotient has a nor- 
mative range of 0.4 to 0.6 for males and females and is 
believed to indirectly reflect vocal fold adduction force. 
Another measure, the contact index, is defined as the differ- 
ence between increasing and decreasing vocal fold contact 
durations, divided by total contact phase duration (Orlikoff, 
1991). This measure reflects contact symmetry (a value of 
0 is associated with perfect symmetry) and is typically 
between —0.4 and —0.6 for males and females in modal reg- 
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Figure 7.22 Electroglottographic signals collected during voice pro- 
duction in modal, loft (falsetto), and pulse (vocal fry) registers. (From 
Baken, R. J., Orlikoff, R. F. (2000). Clinical measurement of speech and 
voice [2nd ed.]. San Diego: Singular Publishing Group. [© 2000]. 
Reprinted with permission of Delmar Learning, a division of Thomson 
Learning: www.thomsonrights.com.) 


ister. Pulse register phonation is associated with a contact 
index around —0.8; falsetto register phonation is associated 
with a contact index close to 0 (Orlikoff, 2003). 

Qualitative changes in EGG signal geometry have been 
associated with specific alterations in vocal fold dynamics. 
Characteristic patterns are seen with each of the vocal regis- 
ters (Fig. 7.22). Using modeling data, Titze (1990) described 
four dimensions of variation in the EGG signal accounted for 
by specific vocal fold adjustments (Fig. 7.23). Widening of 
the signal peak is associated with increased vocal fold 
adduction forces alongside an increased contact quotient. 
Skewing of the signal peak is a reflection of greater increas- 
ing contact compared with decreasing contact speed, which 
is a consequence of the natural convergent nature of the 
glottis combined with the vibratory phase difference bet- 
ween the upper and lower vocal fold margins. The presence 
of a bulge in the increasing and decreasing contact portions 
of the signal (termed skirt elevation) is associated with a 
physical bulge on the vocal fold medial surface. Finally, 
increased triangularity (termed skirt ramping) of the signal is 
observed with an increased vertical phase difference. 


¢ Laryngeal Visualization 


Visualization of the larynx during respiratory and phonatory 
activities is of enormous benefit in the diagnosis and treat- 
ment of voice disorders. By observing the laryngeal struc- 
tures under various phonatory and nonphonatory conditions, 
critical diagnostic, prognostic, and treatment-shaping data 
can be obtained. Technological advancements continue to 
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Figure 7.23 Qualitative variations in the electroglotto- 
graphic signal and associated vocal fold adjustments. 
(From Titze, I. R. [1990]. Interpretation of the elec- 
troglottographic signal. J Voice 4, 1-9. © 1990 The Voice 
Foundation. Reprinted with permission.) 
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improve endoscopic instrument quality as well as image 
capture, analysis, and storage capabilities. In addition, the 
appropriate interpretation of complex vibratory patterns is 
aided by a growing understanding of lamina propria ultra- 
structure and vocal fold physiology. 

Laryngeal visualization is within the scope of practice of 
both otolaryngologists and speech-language pathologists 
(American Speech-Language-Hearing Association, 1998). 
Otolaryngologists use the information gained from inter- 
preting laryngeal images when making a medical diagnosis 
and planning a medical or surgical intervention strategy. 
Speech-language pathologists use this information to assess 
vocal function, evaluate therapy probes, plan behavioral 
intervention, and administer biofeedback. In many voice- 
care settings, otolaryngologists and speech-language pathol- 
ogists work together to interpret all available case history 
and vocal function data and form a joint intervention strat- 
egy drawing on medical, surgical, and behavioral approaches 
as needed. 


Surface bulging 


Vertical phasing 


Indirect Laryngoscopy 


Indirect laryngoscopy refers to a series of clinical procedures 
designed to allow indirect visualization of the larynx via light 
reflected against a mirror or prism system or transmitted 
through a fiberoptic cable (Fig. 7.24). These procedures allow 
evaluation while a patient is awake and in many cases unanes- 
thetized. Although extremely useful in the assessment of 
laryngeal structure and function, it is important to note that 
observing the natural physiologic function of the larynx can be 
challenging in many cases due to individual oral-pharyngeal 
anatomy, certain hyperfunctional postures, and the gag reflex. 


Mirror Laryngoscopy 


Mirror laryngoscopy is a traditional technique allowing gross 
visualization of the larynx. In this procedure, the tongue is 
drawn forward while a small laryngeal mirror is placed in the 
oropharynx, positioned to allow visualization of the larynx. A 
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Figure 7.24 Instrumentation used for indirect laryngoscopy. (A) A 
laryngeal mirror, (B) 70-degree rigid endoscope, and (C) flexible endo- 
scope. 


continuous light source is used to reflect light from the mir- 
ror. The patient is generally instructed to produce /i/ at a high 
pitch during this examination. This phonatory posture, com- 
bined with holding of the tongue, functions to elevate the 
larynx and raise the epiglottis, thus allowing maximum visual- 
ization of the vocal folds. Although useful for rapid and gross 
examination, standard mirror laryngoscopy does not facili- 
tate normal physiologic voice production or the evaluation of 
vocal fold vibratory parameters. A stroboscopic light source 
can be employed with this procedure; however, permanent 
image capture is difficult. Finally, the lack of magnification 
provided by the laryngeal mirror can mask the presence of 
subtle laryngeal abnormalities with this technique. 


Rigid Endoscopy 


Rigid endoscopy employs a rigid metal tube device (rigid 
endoscope) containing a lens and prism system designed to 
transmit high-intensity light from its proximal to its distal 
end and a highly magnified image from its distal to its prox- 
imal end. A fiberoptic cable provides light from a light 
source, which is transmitted through a channel of the endo- 
scope’s optical array to illuminate the viewing field. A prism 
providing either a 70-degree or 90-degree viewing angle 


facilitates image transmission through an alternate optical 
channel to the endoscope’s eyepiece. A photographic or 
videographic capture system is usually connected to the eye- 
piece, allowing permanent image documentation. 

For laryngeal visualization, the patient is either seated 
upright (if employing a 90-degree rigid endoscope) or with 
the upper body and head extended forward (if employing a 
70-degree rigid endoscope). As with mirror laryngoscopy, 
the tongue is drawn forward while the rigid endoscope is 
placed in the oral cavity and inserted as far as the orophar- 
ynx. Whereas certain patients require the application of top- 
ical anesthetic to the oropharynx, most are able to tolerate 
this procedure well if appropriately positioned and carefully 
examined. In a small number of cases, however, individual 
oral-pharyngeal anatomy may preclude the use of this tech- 
nique altogether. Rigid endoscopy has the advantage of pro- 
viding a high-quality magnified image (Fig. 7.25), adequate 
for the identification of subtle laryngeal anomalies. Further, 
the high-intensity light transmission capacity of the rigid 
endoscope allows the use of stroboscopic and high-speed 
imaging techniques. The primary disadvantage of this proce- 
dure, as with mirror laryngoscopy, is the interference of the 
endoscope with natural voice and speech production, 
restricting phonatory samples to isolated vowels. 


Flexible Endoscopy 


Flexible endoscopy involves the use of a thin fiberoptic cable 
device (flexible endoscope) containing an array of fibers 
arranged to transmit light from its proximal to its distal end 
and an image from its distal to its proximal end. As with the 
rigid endoscope, an attached fiberoptic cable connects the 
endoscope to a light source. A control lever allows upward- 
downward maneuvering of the endoscope tip. An eyepiece is 
available for direct observation or connection to an image 
capture system. 

Laryngeal visualization using a flexible endoscope is 
transnasal. This approach holds the benefit of bypassing the 
gag reflex and allowing evaluation of velopharyngeal struc- 
ture and function en route to the larynx. Topical anesthetic is 
generally used for this procedure. The endoscope is gradu- 
ally advanced through the nasal cavity, nasopharynx, and 
oropharynx to the laryngopharynx, where it is positioned to 
provide maximum exposure of the vocal folds or other laryn- 
geal feature of interest. 

Flexible endoscopy has the advantage of allowing the eval- 
uation of phonatory function in a natural posture during 
connected speech and singing. Further, the small diameter of 
many flexible endoscopes allows for the evaluation of infants 
and young children. Certain disorders such as functional dys- 
phonia, spasmodic dysphonia, and paradoxical vocal fold 
motion are best evaluated using this procedure (although 
often as an adjunct to rigid endoscopy). Stroboscopy can be 
performed in conjunction with a high-quality flexible endo- 
scope and stroboscopic light source. The primary disadvan- 
tages of flexible endoscopy are limited light transmission 
through the fiberoptic array (particularly with small- 
diameter endoscopes) and suboptimal image resolution. Opti- 
cal distortion is an additional consideration (Hibi, Bless, Hirano, 
et al., 1988). A significant recent development in flexible 


154 Clinical Voice Disorders 








Figure 7.25 Laryngeal still images captured using a 70-degree rigid 
endoscope. (A) The upper image shows the larynx during resting 
breathing, viewed using a continuous light source. (B) The middle 
image shows a posterior gap closure pattern on phonation, viewed 
using a stroboscopic light source. (C) The lower image illustrates the 
mucosal wave propagating across the superior surface of both vocal 
folds during phonation, also viewed using a stroboscopic light source. 


endoscopy technology is the availability of endoscopes with 
a miniature digital camera at the distal tip. This, alongside 
parallel advancements in fiberoptic technology, holds prom- 
ise for ongoing improvements in image quality. 


Direct Laryngoscopy 


Direct laryngoscopy involves direct visualization of the lar- 
ynx, requiring the patient to be placed in a supine position 
under general anesthetic. This procedure is performed by the 
otolaryngologist. A laryngoscope is inserted into the oral cav- 
ity and positioned to allow maximum exposure of the vocal 


folds. The laryngoscope is often suspended to allow the sur- 
geon the use of both hands. Direct laryngoscopy allows close 
examination of the laryngeal tissues, including palpation of 
the vocal folds (e.g., to evaluate tissue and lesion pliability) 
and manipulation of the arytenoid cartilages (e.g., to differen- 
tiate ankylosis from paralysis as a cause of arytenoid immo- 
bility). A microscope is commonly used for detailed visualiza- 
tion. A contact endoscope combined with methylene blue 
staining may also be employed to examine cellular, nucleic, 
and cytoplasmic features of the vocal fold mucosa at high 
magnification. If indicated, tissue biopsy may be performed. 

Direct laryngoscopy is the most invasive of all laryngeal 
visualization procedures and clearly the most expensive. As 
the patient is anesthetized, phonatory function cannot be 
evaluated. Direct laryngoscopy is indicated in cases where 
tissue changes observed on indirect examination warrant 
biopsy and in rare cases where indirect laryngoscopy is not 
possible. 


Parameters of Interest 


Laryngoscopy performed under continuous light allows the 
evaluation of structure and gross function. The clinician 
should examine the laryngeal structures for any signs of 
acute or chronic irritation or injury and for irregularity or 
asymmetry. Potentially important structural signs include 
edema, erythema, pachydermia, leukoplakia, erythroplakia, 
vascular ectasia and varices, hemorrhage, vocal fold medial 
edge irregularity and/or roughness, vocal fold length and/or 
vertical height differences, prolapse, limited airway patency, 
laryngeal tilt or rotation, asymmetric regions of fullness, sul- 
cus vocalis, and the presence of mass lesions. Gross func- 
tional parameters include degree and symmetry of arytenoid 
excursion during vocal fold adduction and abduction, diado- 
chokinetic ability, cough, stridor, paradoxical vocal fold 
motion, static or intention tremor, excessive mucus pooling, 
and supraglottic hyperfunction in the anteroposterior and 
mediolateral planes. 


Studying Vocal Fold Vibratory Patterns 
Videostroboscopy 


Videostroboscopy refers to the use of a stroboscopic (pulsed) 
light source for endoscopic visualization of vocal fold vibra- 
tory parameters in apparent slow motion. The essential prin- 
ciple of this technique is centered on the visual perceptual 
phenomenon of apparent motion, whereby successive still 
images presented at a rate greater than 5 Hz appear to be 
moving continuously. This concept (also the basis of regular 
videography) is specifically exploited in videostroboscopy by 
using a stroboscopic light to illuminate discrete moments of 
subsequent vocal fold vibratory cycles, which are then per- 
ceived as an apparent single vibratory cycle. The stroboscopic 
light source is synchronized with vocal fold vibratory Fo cap- 
tured using a microphone or electroglottographic transducer. 
The light source may be precisely synchronized with Fo to 
provide an apparent motionless image (as each strobe pulse 
illuminates an identical position in each vibratory cycle), or 
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quasi-synchronized (1 to 2 Hz above Fo) to provide apparent 
slow motion (as each strobe pulse illuminates successive 
phases of subsequent vibratory cycles). 

Videostroboscopy is a critical tool in any clinical voice set- 
ting. It facilitates the evaluation of subtle vocal fold vibratory 
parameters and provides a basis for making inferences con- 
cerning the integrity of the vocal fold body-cover relation- 
ship. Several reports have demonstrated the importance of 
videostroboscopy in reaching an accurate diagnosis, revising 
a previous diagnosis, and designing treatment (Casiano, 
Zaveri, and Lundy, 1992; Woo, Colton, Casper, et al., 1991). 
The primary limitation of this technique is its reliance on the 
accurate extraction of Fo for synchronization of the strobo- 
scopic light, rendering the examination of voices characterized 
by highly aperiodic vibratory patterns impossible. Secondary 
limitations include an inability to capture vibratory behav- 
iors during phonatory onset and offset, and the fact that the 
observed vibratory “cycles” are illusionary. 

Key vocal fold vibratory parameters observed under stro- 
boscopic light include glottal closure pattern, amplitude, 
mucosal wave, presence of nonvibrating segments, phase 
closure, phase symmetry, and regularity (Bless, Hirano, and 
Feder, 1987; Hirano and Bless, 1993; Poburka, 1999). Glottal 
closure pattern is observed during the closed phase of the 
vibratory cycle and generally described using the following 
labels: complete, incomplete, irregular, hourglass, posterior 
gap, anterior gap, spindle gap, or variable. An abnormal clo- 
sure pattern (i.e., not complete at normal pitch and loud- 
ness) can result from poor vocal fold adduction due to 
recurrent laryngeal nerve damage or cricoarytenoid anky- 
losis, obstruction due to a vocal fold medial edge lesion, 
laryngeal web, or granuloma, or reduced tissue pliability 
due to scarring or sulcus vocalis. It is important to note that 
a posterior gap closure pattern is considered normal in 
females (Biever and Bless, 1989). Vibratory amplitude 
refers to the maximum mediolateral excursion of the vocal 
folds and is typically around 30% of the total vocal fold 
width in most individuals at normal pitch and loudness. 
Each vocal fold should be rated independently for this 
parameter. Decreased amplitude is associated with a 
decrease in vocal fold length (e.g., in females and children 
compared with males; or in the case of a laryngeal web), 
increased vocal fold tension (e.g., associated with increased 
Fy or due to hyperfunction), reduced tissue pliability, 
reduced Ps and loudness, and increased vocal fold mass 
(e.g., due to chronic edema or a mass lesion). Opposite 
directional changes in these variables are associated with 
increased vibratory amplitude. 

The mucosal wave refers to the phenomenon of a traveling 
wave moving across the superior surface of the vocal folds in 
the medial to lateral direction. At normal pitch and loudness, 
mucosal wave excursion is typically 50% of the total vocal 
fold width. A reduction in mucosal wave excursion (or com- 
plete absence of mucosal wave) is seen in cases of reduced 
tissue pliability, such as with vocal fold scarring, sulcus 
vocalis, mass lesions, epithelial hyperplasia, and mucosal 
dryness. Increased Fo and falsetto register phonation are also 
associated with reduced mucosal wave excursion. Increased 
mucosal wave excursion is often seen in cases of polypoid 


degeneration, in children who have an immature vocal fold 
layer structure, and alongside increases in Ps and loudness. It 
is important to note that variations in mucosal wave presen- 
tation may be regional as a consequence of focal tissue 
changes. A regional absence of mucosal wave combined with 
the absence of vibratory amplitude (i.e., complete regional 
immobility) characterizes what is termed a nonvibratory seg- 
ment. This feature may be constant or isolated to a certain 
phonatory condition or posture and is typically observed in 
cases of markedly impaired tissue pliability. 

Phase closure reflects the relative durations of the open 
and closed phases of an observed vibratory cycle. At normal 
pitch and loudness, the open phase may account for ~40 to 
60% of the entire cycle. A vibratory cycle that is predomi- 
nately open phase is typically associated with a breathy 
voice quality due to glottal hypofunction. In contrast, a 
vibratory cycle that is predominately closed phase is often 
associated with a pressed voice quality due to glottal hyper- 
function. Variations in Ps, Fo, vocal effort level, and vocal reg- 
ister all affect phase closure. Measurements can be made by 
taking a ratio of actual time duration or the number of video 
frames for both open and closed phases. When measuring 
this parameter, it is generally helpful to sample multiple 
vibratory cycles. 

Phase symmetry is a measure of the degree to which the 
two vocal folds mirror each other during vibration. The pres- 
ence of asymmetry may result from differences in vocal fold 
mass, length, and/or tension, which in turn may result from 
a unilateral lesion or edema, unilateral neuropathy, or uni- 
lateral differences in tissue pliability or viscosity. In some 
cases, unilateral vocal fold tension differences may be func- 
tional. Any vibratory asymmetry should be described as 
either constant, intermittent, or specific to a particular 
phonatory condition or posture. 

Regularity, also referred to as periodicity, refers to the 
degree of consistency from one vibratory cycle to the next. 
Irregularity can result from impaired motor control at the 
respiratory or phonatory levels, in addition to vocal fold bio- 
mechanical asymmetries. A popular method for evaluating 
this parameter involves the exact synchronization of Fp and 
the stroboscopic light source. In this mode, regular vibration 
appears as an apparent static image, whereas irregular vibra- 
tion is characterized by cycle-to-cycle movement. 

A complete videostroboscopic examination should incor- 
porate a range of tasks, including phonation at normal pitch 
and loudness levels, at points throughout the pitch and 
dynamic range, during glissando production, both within 
and transitioning between different registers, and during 
inhalation phonation. These tasks are in addition to those 
used to assess gross laryngeal function under continuous 
light. If a flexible endoscope is employed, speech and singing 
samples should also be elicited. 


Videokymography 


Videokymography is a charge-coupled device-based video 
technology allowing high-speed single-line scanning of an 
image of interest. Each line captured by the scan camera is 
appended to the preceding line to provide a series of stacked 
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images across time. When applied to the larynx, videoky- 
mography allows the visualization of vocal fold vibratory 
behavior for a single mediolateral plane. Line scanning is 
performed at ~8 kHz, which is sufficient to capture actual 
vibratory behavior in real time. This allows the observation 
of aperiodic vibratory patterns that cannot be seen using 
videostroboscopy. 

“Commercial videokymography cameras function in two 
modes. A standard mode is employed for orientation of the 
endoscope and selection of the plane for videokymography. A 
videokymographic mode is then employed for high-speed sin- 
gle line scanning and image capture. In first generation sys- 
tems, standard and videokymographic modes are engaged 
sequentially and line scanning occurs using the uppermost line 
of the image observed in the standard mode. As a result, selec- 
tion of the midmembranous vocal fold as the plane of interest 
requires appreciable endoscope tilt. This favors the use of a 90° 
rather than a 70° rigid endoscope. In second generation sys- 
tems, images from both modes are viewed and recorded 
simultaneously and the scanning line is not restricted to the 
uppermost line of the image (Qiu and Schutte, 2006). 

As stated, the primary advantage of videokymography 
over videostroboscopy is its ability to capture actual vibra- 
tory cycles and therefore facilitate the study of aperiodic 
vocal fold vibratory patterns. Its primary limitations are its 
restriction to a single mediolateral plane (although a series 
of videokymograms can be captured in different planes), 
and the potential introduction of artifact by movement of 
the endoscope or larynx during image capture. To avoid 
this, the scanning line should be maintained on a steady 
mediolateral plane, perpendicular to the glottal axis (Svec, 
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Sram, and Schutte, 2001), and (in first generation systems) 
the measurement position checked regularly by switching 
between videokymographic and standard camera modes. 
Also, when collecting and presenting videokymographic 
data, it is important to specify the antero-posterior posi- 
tion and any rotational orientation of the scanning line 
with respect to the glottis; in addition to the pitch, loud- 
ness and register features of the phonatory sample of 
interest.” 

Videokymography allows the evaluation of many of the 
same vibratory features as videostroboscopy; however, these 
features are presented in a different form (Figs. 7.26 and 
7.27). The glottis is seen as a dark region in the center of the 
kymogram, varying with respect to changes in vibratory 
amplitude throughout the cycle. The open, closed, opening 
and closing phases are easily identified. During the closing 
phase, the vocal fold medial surface and lower margin can be 
observed adjacent to the contour of the upper margin. The 
mucosal wave presents as a thin band that extends beyond 
the end point of the opening phase, typically maintaining a 
similar trajectory. Abnormal phase closure, phase asymme- 
tries, and phase irregularities are particularly salient on 
videokymography. 


High-Speed Digital Imaging 


High-speed digital imaging is the most powerful technology 
currently available for the study of vocal fold vibratory 
behavior. Whereas videokymography is a form of high-speed 
imaging, complete high-speed systems allow visualization of 
the entire surface of both vocal folds during phonation 


Figure 7.26 Schematic of a videokymogram illustrating key 
anatomic and vibratory parameters. (From Svec, J. G., Schutte, 
H. K. [1996]. Videokymography: high-speed line scanning of 
vocal fold vibration. J Voice 10, 201-205. [© 1996] The Voice 
Foundation. Reprinted with permission.) 
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rather than a single line. The primary advantages of this 
technology are its lack of dependence on vibratory periodic- 
ity, its ability to capture subtle vibratory features that may 
not be evident in apparent motion, and its ability to capture 
vocal fold dynamics during phonatory onset and offset. Prior 
to the advent of high-speed digital technology in the 1980s 
(Hirose, 1988), high-speed laryngeal visualization relied on 
photographic technology first developed at Bell Laboratories 
in the late 1930s (Timcke, von Leden, and Moore, 1958, 1959; 
von Leden, Moore, and Timcke, 1960; Whitehead, Metz, and 
Whitehead, 1984). 

Most commercially available high-speed systems allow 
visualization of the entire larynx at a frame rate of ~2 kHz. 
This frame rate can be increased alongside a reduction in 
resolution or size of the imaging window. Most high-speed 
digital systems are monochrome; however, color cameras 
are also available. Adequate visualization of the vocal folds 
during vibration requires high light intensity and is best 
performed using a rigid endoscope. A computer with high 
data transfer rate video memory is also required. Depend- 
ing on video memory capacity and frame rate, most cur- 
rent systems allow 2 to 8 seconds of data collection at one 
time. 

High-speed digital imaging is a useful research instrument 
and is beginning to be used in clinical settings. Most of the 
current challenges associated with this technology (frame 
rate, resolution, color quality, need for high light intensity, 
video memory data transfer rate and capacity) are technical 
and should be resolved with time. The vocal fold vibratory 
parameters of interest in high-speed digital images are simi- 
lar to those observed using videostroboscopy, with the addi- 
tion of preoscillatory and damping behaviors seen during 
phonatory onset and offset. High-speed images are also well 
suited to quantitative image analysis. 


Analysis of Images 
Visual Perceptual Rating 


In the majority of instances, laryngeal images are inter- 
preted qualitatively using individual visual perceptual 
judgments. These judgments feature the same validity and 


Figure 7.27 Avideokymogram obtained during modal 
register phonation at comfortable pitch and loudness 
levels. (A) The dark horizontal line on the standard cam- 
era mode image represents the orientation of the scan- 
ning line used to generate (B) the videokymogram. 


Time 


reliability challenges associated with auditory perceptual 
ratings. Several key factors in maximizing rating validity, 
interjudge reliability, and intrajudge reliability are quality 
and duration of training and experience, endoscopic image 
quality, an appreciation for artifacts associated with image 
distortion, and the nature of the rating system employed. 
Poburka and Bless (1998) demonstrated the effectiveness 
of 4 to 5 hours of computer-aided multimedia training in 
improving the accuracy of videostroboscopic ratings per- 
formed by inexperienced judges. After training, interjudge 
reliability improved between the inexperienced judges 
and a group of experienced judges. In a subsequent study, 
Poburka (1999) developed and evaluated an alternative 
videostroboscopy rating system based on feedback from 
the experienced judges (Fig. 7.28). Although intuitively 
easier to use, interjudge reliability with this alternative 
rating system was similar to that measured using a tradi- 
tional rating system. Poburka noted that phase closure, 
phase symmetry, and phase regularity were the most chal- 
lenging features to rate, regardless of the system 
employed. In an effort to address artifacts associated with 
image distortion, Peppard and Bless (1990) demonstrated a 
transparency tracing procedure to control endoscope-lar- 
ynx distance and improve image consistency across multi- 
ple examinations. 


Quantitative Image Analysis 


Attempts to quantify the vocal fold vibratory pattern first 
began with high-speed photography (Timcke, von Leden, 
and Moore, 1958, 1959; von Leden, Moore, and Timcke, 
1960), and have continued with videokymography and 
high-speed digital imaging (Larsson, Hertegard, Lindestad, 
et al., 2000; Qiu, Schutte, Gu, et al., 2003). Vibratory pattern 
data have also been extracted from videostroboscopic 
images (Lee, Kim, Park, et al., 1998; Lee, Kim, Sung, et al, 
2001); however, such data should be interpreted judi- 
ciously because of their reliance on apparent motion. The 
cardinal parameters of interest in these analyses are 
changes in individual vocal fold position, as well as changes 
in glottal area over time. From these parameters, glottal 
width and area functions can be derived and various 
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Stroboscopy Evaluation Rating Form (SERF) 
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Figure 7.28 The Stroboscopy Evaluation Rating Form (SERF), developed for the visual perceptual rating of videostrobo- 
scopic parameters. (From Poburka, B. J. [1999]. A new stroboscopy rating form. | Voice 13, 403-413. [© 1999] The Voice 
Foundation. Reprinted with permission.) (Continued on page 158) 


time-based indices calculated. The most commonly used 
glottal parameters are the open, closed, opening, and clos- 
ing phases. From these, the open, closing, and speed quo- 
tients can be calculated. Recall that these terms are also 
employed when quantifying key features of the morpho- 
logically similar but theoretically distinct flow glottogram. 
Unfortunately, they (in particular the open quotient) are 


often inappropriately applied to the vocal fold contact 
(EGG) waveform. 

Advances in computing power and refinements in imaging 
analysis technology continue to offer improvement in the 
speed and accuracy with which key laryngeal and vocal fold 
vibratory parameters can be measured. Image analysis appli- 
cations include absolute dimensional calibration based on 
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laser triangulation (Rosen, Minhaj, Hinds, et al., 2003; Schu- 
berth, Hoppe, Dollinger, et al., 2002), compensation for 
artifact due to endoscopic image distortion (Deliyski and 
Petrushev, 2003; Rosen, Minhaj, Hinds, et al., 2003), classifi- 
cation of vocal fold lesion color and texture (Wittenberg, 
Kothe, Munzenmayer, et al., 2003), extraction of kymograms 
from high-speed digital images (Tigges, Wittenberg, Mergell, 
et al., 1999), and detection of the vocal fold edge contour 


and/or mucosal wave (Deliyski and Petrushev, 2003; Yan, 
Ahmad, Kunduk, et al., 2005). 

Figures 7.29 and 7.30 illustrate a series of analytic tech- 
niques developed by Yan, Ahmad, Kunduk, et al. (2005) to 
quantify the vocal fold vibratory pattern observed using 
high-speed digital imaging. An edge detection algorithm is 
employed to extract the glottal area function. From this, an 
analytic phase plot (termed Nyquist plot) is derived using a 
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Vocal fold edge detection Figure 7.29 Vocal fold edge detection, glottal area waveform 
extraction, instantaneous frequency measurement, and analytic 

phase plot construction, based on high-speed digital images. (From 
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Hilbert transform technique. The Nyquist plot represents a tions from the mean, as based on analysis of the glottal area 
projection of the glottal area function in the complex plane function. 

over time and provides a visually salient representation of 

vibratory behavior. The trace pattern within the plot repre- 

sents intracycle variation, and the degree of scatter repre- ¢ Summary 

sents intercycle variation. Characteristic pattern deviations 

have been associated with certain voice quality deviations 

and vocal pathologies. A second plot, termed the perturba- A complete voice evaluation spans a range of domains and 
tion plot, summarizes instantaneous Fy and amplitude devia- yields a clinical data set requiring careful integration and 
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Analytic phase plot ("Nyquist" plot) 





Figure 7.30 Characteristic phase and perturbation plot pat- 
tern deviations associated with phonatory and pathologic con- 
ditions, extracted from high-speed digital images. Phase plots: 
(a) normal; (b) normal (high pitch, normal loudness); (c) 
breathy; (d) stage whisper; (e) hyperfunction; (f) pressed; (g) 
vocal fold scarring (high pitch, normal loudness); (h) vocal fold 
stiffness; (i) vocal fold cyst; (j) after surgical removal of vocal fold 
cyst. Perturbation plots: (a) normal (normal pitch, normal loud- 
ness); (b) after surgical removal of vocal fold cyst; (c) vocal fold 
cyst; (d) vocal fold scarring. (From Yan, Y., Ahmad, K., Kunduk, 
M., Bless, D. M. [2005]. Analysis of vocal-fold vibrations from 
high-speed laryngeal images using a Hilbert transform based 
methodology. J Voice 19, 161-175. © The Voice Foundation. 
Reprinted with permission.) 
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interpretation. It is important to note that not all testing pro- 
cedures are necessarily required in every case. Test selection 
should be driven by the case history and symptoms 
described by the patient. Procedures should be efficient and 
avoid unnecessary replication. Within each setting, the clini- 
cian should be careful to collect data that are comparable 
from visit to visit and with data reported in the literature. 
Literature-sourced data should be interpreted critically. The 
clinician should hold a solid understanding of how a given 
test or assessment procedure evaluates laryngeal structure 
and function and its role in making a differential diagnosis or 
constructing a treatment strategy. It is important to be wary 
of cases in which the different elements of a patient’s assess- 
ment data set appear incongruous. Such variation in per- 
formance may reflect the nature of the specific disorder, a 
favorable or unfavorable compensatory strategy, or an 
assessment error. 

Certain elements of voice assessment were not addressed 
in detail here, either because they are given adequate atten- 


ig hie eae 3 ieee 


tion elsewhere in this book or because they provide supple- 
mentary data to those obtained by the speech-language 
pathologist. Psychologic interviewing is addressed in detail 
in a separate chapter. Excess musculoskeletal tension 
assessment is discussed within the context of voice ther- 
apy. Radiologic imaging studies are often ordered by the 
otolaryngologist when evaluating structural defects and 
when searching for a mass either in the laryngeal region or 
along the path of the recurrent or superior laryngeal 
nerves. Additionally, magnetic resonance imaging (MRI) 
has been used successfully to evaluate soft tissue displace- 
ment after vocal fold medialization surgery (Ford, Unger, 
Zundel, et al., 1995). Laryngeal electromyography (LEMG) 
holds an important role in the evaluation and characteriza- 
tion of vocal fold movement disorders, including paralysis 
and paresis, and other neuromuscular disorders such as 
myasthenia gravis. It is helpful in differentiating paralysis 
from ankylosis and in making a prognosis for nerve recov- 
ery after injury. 
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Chapter 8 


Psychogenic and Other Behavioral Voice Disorders 


Voice is more than a mechanical or acoustic phenomenon. 
It is a mirror of personality, a carrier of moods and emo- 
tions, a key to neurotic and psychotic tendencies. 


—Brodnitz 


When I began my tenure at the Mayo Clinic 45 years ago, 
I was overwhelmed and uncomprehending of the richness 
of clinical material. “You can do anything you want here,” 
Dr. Charles Mayo said to me on my intake interview, “so long 
as you remember that the patient comes first.” Then they 
started coming, the word must have gotten around. They 
came from every medical and dental specialty conceivable. 
Referrals came for speech disorders that I had never read or 
even heard about, undocumented speech, voice, and lan- 
guage signs and symptoms: pharyngeo-laryngeo oculo-lin- 
guo myoclonus, foreign dialect syndrome, focal dystonias, 
borborygmi. I began to specialize in identification of the 
dysarthrias in neurologic disease and proposed to Dr. Fred 
Darley that we study the perceptual characteristics of the 
dysarthrias. 

The second disorder that piqued my interest was voice 
disorders, particularly patients with dysphonias and apho- 
nias referred to me from otolaryngologists—patients who 
had normal laryngologic examinations and for whom no 
rational organic etiology was apparent; patients often 
labeled as having functional or nonorganic etiologies. Being 
puzzled by the term functional caused me to wonder how it 
was possible for a person to be speaking in a normal voice 
since early childhood and then, for the first time in adult- 
hood, develop a hoarse, breathy, or loss of voice entirely (as 
in whispering or muteness). I thought back to my under- 
graduate and graduate education and realized that the 
causes of nonorganic voice disorders were never addressed 
in any logical or convincing fashion. Vocal abuse and vocal 
hyperfunction were the only categories mentioned. 

Then a patient was referred to me who taught me that 
most nonorganic voice disorders did not occur sponta- 
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neously or from vocal abuse or faulty habit pattern. A 45- 
year-old nun, a nurse who worked on the ophthalmology 
floor of a hospital, had come to the clinic complaining of 
episodes of whispering. Her laryngologic examination failed 
to disclose any evidence of laryngeal pathology. 

Never having been confronted by someone with an apho- 
nia, I recorded and listened carefully to her voice, searching 
for words that would describe the complexity of her apho- 
nia. It was a mixture of breathiness and strained hoarse- 
ness, interspersed with squeaks and squeals. Then I asked 
myself, “What was I supposed to do for this patient and her 
referring physician? Why, if this patient had a normal 
laryngeal examination, was she whispering and speaking 
with a squeaky voice?” She was a quiet nun, not yelling at 
basketball games or arguing loudly with others. | began to 
search the medical library for articles on aphonia. I discov- 
ered that many had been written by otolaryngologists and 
psychiatrists stating that aphonia is often caused by psy- 
chologic conflict, that it could be a hysterical or conversion 
disorder. 

She complained of a pain in the region of her larynx that 
radiated to her ears and down to her chest. She said that she 
felt as if she had a lump in her throat, like a ball that was dif- 
ficult to swallow. At that point, I believed I had two options: 
to refer her to psychiatry or become the psychiatrist myself 
and investigate the psychogenic hypothesis. I was in a 
quandary. Did | have the ethical right to investigate this 
patient’s personal life? Would it be dangerous to her if I did? 
I had no prior personal training in psychodiagnostic inter- 
viewing or psychotherapy. However, in light of the fact that 
I had been told I could do anything, I wanted to help and was 
determined to pursue a psychodiagnostic investigation. 

At that time, the chairman of the Department of Psychiatry 
was Dr. Edward M. Litin. He was a tall, rangy man with a 
deep, resonant voice; he could probably have been cast in 
the role of God. I had met him at a reception and liked his 
directness. I phoned him and asked him if he could meet 
with me, to which he agreed. Where patients were con- 
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cerned, these were called “curbside consultations.” We met 
over a cup of coffee, and I told him about my patient who 
was still sitting in my office. I told him about my reservations 
concerning the ethics and safety of my attempting a probing 
psychodiagnostic examination. I can see him now; leaning 
back in his chair, his long legs stretched out in front of him, 
and with that sepulchral voice of his he told me, “Arnold, I 
have found that human beings are tougher than that; that 
each of us possesses a character armor that shields us from 
divulging painful information. And as for the ethics of inves- 
tigating emotional causes, I think it is reasonable to say that 
it would be unethical if you did not do so.” He continued, 
“Actually, patients usually feel much better after they have 
revealed or unburdened themselves of repressed conflicts. 
Such catharsis provides the patient with a sense of relief for 
having unburdened themselves. And they are better off for 
the experience.” 

I returned to my patient and asked her if any of these 
aphonic episodes were associated with stresses in her life or 
personal problems. Tearfully, she told me that her life as a nun 
had not been what she had expected. At her age, it was overly 
physically demanding. She was expected to work 80 hours per 
week, ministering to patients on the ophthalmology floor, and 
that she no longer had the energy to sustain this kind of work- 
load. But she could not bring herself to tell the mother supe- 
rior and ask for a reduced workload. It was just against her 
concept of her responsibilities as a nun and a nurse. As she 
cried and spoke, her voice gradually returned to normal. 

And that was my baptism of fire, a classic conversion 
disorder—a loss of motor or sensory function of nonorganic 
etiology as a solution to a conflict. Unconsciously, without a 
voice she was cast ina sick role, the symbolic function of the 
voice loss now being that she would be unable to divulge her 
true feelings to her superior. 

Now, I know that many speech pathologists have not been 
trained as | had, not in this technique, and are reluctant to 
embark upon such a venture with a patient. But this first les- 
son has been the mainstay of my practice as a specialist in 
voice disorders, and ever since I have made it a rule to 
assume that all nonorganic voice disorders are to be pre- 
sumed as having a psychogenic etiology and that they all 
need to be subjected to a psychodiagnostic interview. 


+ Voice as an Indicator of Normal 
Personality 


Personality encompasses those behaviors that distinguish 
each individual as being different from others (e.g., facial 
expression, gestures, posture, gait, intelligence, aggressive- 
ness, confidence, attitudes, feelings, sensitivity, and emo- 
tional reactivity). Although one might argue that how a 
person speaks—voice quality, pitch, loudness, stress patterns, 
rate, pause, articulation, vocabulary, syntax, and ideational 
content—qualifies as a trait of personality, attempts to 
demonstrate it have been confounded by several obstacles 
including individual differences in anatomy, imitation or 


learning of parental and sibling voice patterns, and semantic 
and methodologic impediments: 


unsatisfactory definition of personality 
questionable validity and reliability of personality tests 


+ 
+ 
¢ difficulty in defining voice variables 

variations in research methodology (e.g., selection of sub- 
jects as to age, sex, and ethnic background; tests of per- 
sonality and voice; parameters, methods, and units of 


measurement) 


Nevertheless, there has been no shortage of studies on the 
relationship between personality and voice. Early investiga- 
tors attempted to relate auditory perceptual judgments of 
voice to personality traits such as neurotic tendency, anxiety, 
introversion, and extraversion. They suggested associations 
between selected percepts and personality: (1) breathy voice 
with neurotic tendency, anxiety, low dominance, and high 
introversion; (2) harsh/metallic voices with high dominance 
and emotional instability; (3) nasal whine with emotional 
instability and low dominance; (4) high loudness and low 
pitch are found with high dominance (Mallory and Miller, 
1958; Moore, 1939); (5) hoarseness with reticence and self- 
consciousness; (6) increased speaking rate with competence 
and decreased rate with decreased competence and in- 
creased benevolence; (7) wide variations in pitch with bene- 
volence and decreased variations in pitch and average level 
were associated with less competence and benevolence 
(Brown, Strong, and Rencher, 1974; Williamson, 1945). 

At the time these studies were completed, less was known 
about psychophysical scaling techniques, observer bias, the 
physiologic processes underlying voice production, and the 
inherent difficulty in interpreting studies on personality and 
voice. Accordingly, it is not surprising these interesting 
observational associations between perceptual qualities and 
personality have not stood the test of time. This is due, at 
least in part, to the research methods employed. For exam- 
ple, several of the studies used actors to produce the voice 
samples. Subsequent studies showed that different actors 
may attempt to convey the same personality characteristics 
with widely different voice patterns. In other words, actors 
are somewhat individualistic in their acoustic preferences in 
coding personality and speech. Research attempting to relate 
specific voice features and personality characteristics has 
conflicting results implying there is not a one-to-one rela- 
tionship. Differences may relate to personal experiences, 
interpretation of specific dialogues, bias, and the influence of 
listeners’ personality on judgments. Nevertheless, it is clear 
that listeners make some judgments about personality char- 
acteristics from listening to voice. For example, children with 
dysphonia were rated negatively on social and personality 
factors by naive, young adult judges when compared with 
those without dysphonia (Ruscello, Lass, and Podbesek, 
1988). Specifically, based on voice recordings alone, dyspho- 
nic children were judged more negatively on dimensions 
of “dirty,” “cruel,” “bad,” “worthless,” “dishonest,” “sick,” 
“sad,” “unpleasant,” and “ugly” than were their nondyspho- 
nic peers (P < 0.01). Thus it appears that listeners have 
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psychological percepts relating voice and personality but 
those characteristics have not yet been clearly defined. Later 
in this chapter, we will discuss how personality characteris- 
tics may relate to specific voice disorders. 


+ Voice as an Indicator of Discrete 
Emotions 


The idea that vocal acoustics are imbued with cues to vocalize 
emotional state has a long history in human inquiry. For 
instance, Cicero and Aristotle suggested that each emotion is 
associated with a distinctive tone of voice—a view espoused by 
one of the leading contemporary theories of vocal expression. 
However, it was arguably Darwin who provided the first com- 
prehensive description of the sounds associated with emotion. 

Darwin adopted an explicitly comparative perspective, 
examining emotion-related vocal signals in a variety of 
species that included nonhuman primates, ruminants, 
domestic dogs and cats, and humans. In each case, his rich 
descriptions led him to conclude that both these and other 
affective expressions are veridical, meaning that there is a 
direct correspondence between particular signaler states 
and the communicative display produced. Thus, Darwin 
treated emotion-related signaling as an inherently honest 
indicator of internal state. 

However, he also made two important observations that 
suggest a different perspective. The first was that vocal sig- 
nals can induce emotional responses in listeners; for exam- 
ple, when courting males use calls that “charm or excite the 
female” during mating season. The second was then going on 
to say that these vocal signals become associated with the 
“anticipation of the strongest possible pleasure which ani- 
mals are capable of feeling.” “. .. the critical ideas are that 
signalers can use species-typical sounds to influence listener 
affect, and that individually distinctive aspects of these sig- 
nals come to elicit learned emotional response in recipients 
that hear them in association with affect-inducing events” 
(Bachorowski and Owren, 2003). 

Affective prosody is defined as the pitch, intonation range, 
tempo, pausing, emphasis/stress, and rhythm that is auto- 
matically and subconsciously injected into speech and signi- 
fies emotion of the speaker (Monnot, 2005). When words 
and affective prosody do not match, as when someone is 
depressed but professes to be well, healthy listeners attend 
to the tone of voice rather than the linguistic content and 
make social judgments based on that subconsciously 
processed information (Monnot, 2005). Affective prosodic 
function has been shown to be a dominant and lateralized 
function of the right hemisphere (Monnot, 2002; Ross, 1997, 
1998) that declines with age (Monnot, 2005) and that can be 
disrupted or destroyed by clinical conditions such as alco- 
holism, schizophrenia, or by right hemisphere cortical 
lesions (Monnot, 2002; Ross, 1997; Testa, 2001). 

Vocal communication, or affective prosody, includes many 
elements of emotion such as happiness, joy, confidence, 
anger, grief, and boredom. Although social emotions are spe- 
cific to the culture in which they are learned, emotions have 


been described as short-lived psychologic-physiologic phe- 
nomena that represent “honest signals” and are thought to 
be critical to human communication. In fact, human survival 
has been hypothesized to be dependent upon vocal commu- 
nication and language in gregarious species where individuals 
depend upon the group for safety and protection (Monnot, 
2005). A few primary process emotions (anger, sadness, 
joy/playfulness, fear, disgust, and surprise) are recognized by 
individuals from diverse cultural backgrounds on the basis of 
facial expressions and vocal cues. Recent perceptual and 
acoustic analyses studies indicate that to varying degrees, 
one emotional state can be distinguished from another using 
pitch, duration of consonants and vowels, intensity, speech 
rate, and the means, bandwidth, and spectral shape of the 
first, second, third, and fourth formants. Reports on percep- 
tual classification of these same emotions ranges from 40 to 
60% (Bachorowski and Owren, 2003). These authors suggest 
that the use of acted rather than natural stimuli may result in 
emotional portrayals that do not closely correspond with 
naturally produced vocal expressions of emotion. 

Monnot, Orbelo, Riccardo, et al. (2003) have demonstrated 
that acoustic analyses of fundamental frequency (Fo) are posi- 
tively associated with subjective judgments of emotion and 
that detoxified alcoholics who could not vary Fp were unable 
to convey emotion accurately to communication partners. 
This is corroborated by studies of Fp and emotion demonstrat- 
ing that speech without Fo, such as whispering, is almost 
devoid of cues that enable the listener to detect emotion 
(Monnot, Orbelo, Riccardo, et al., 2003). However, not all 
recent research on voice and emotions is in agreement. Stud- 
ies by Kakehi (2005) designed to investigate the perceptual 
relationship among six basic emotional categories (happiness, 
fear, anger, disgust, sadness, and neutrality) found that the 
emotional perception of voice was not categorical. 

Similar results were reported by Morosomme and Verduy- 
ckt (2005) in a study attempting to compare adolescent with 
adult listeners’ perceived emotions (anger, fear, joy, sadness, 
neutral, and others) expressed by an actor and patient with 
progressive muscular atrophy. As in other perceptual studies 
of emotion, agreement was generally no more than 60%, and 
differences between adults and adolescents and between 
males and females were minimal. Perceptual accuracy is 
reduced when the stimuli presented have been filtered or 
presented in noise that masks semantic content and are 
influenced by talker sex and emotional traits and pursuit of 
social goals and context in which the sample is being pro- 
duced. Identification of emotions conveyed through voice 
might be increased by acoustic analyses that does not 
depend on visual cues and removes listener bias. In a Russian 
study of actors, recognition rate of a set of emotions (anger, 
irony, surprise, fear, annoyance, joy, shame, and neutral) was 
nearly 80% correct using a set of acoustic parameters to 
describe “prosodic emotions” (pitch, duration of consonants 
and vowels, intensity, speech rate, means of the first, second, 
third, and forth formants, bandwidth and spectral shape 
[Sidorova, 2005]). 

Campos, Thein, and Owen (2003) argue that one function 
of vocal emotional expression is to regulate behavior. In a 
mother-child dyad, these investigators manipulated excla- 
mations of anger, fear, or joy when an infant was approaching 


8 Psychogenic and Other Behavioral Voice Disorders 169 





an object and measured how the vocalizations influenced 
both looking to the mother and behavior toward the toy. Both 
anger and fear vocalizations produced significant response 
inhibition with little differences in infant’s emotional behav- 
ior. On the other hand, the emotional response to joy vocal- 
izations differed from anger and fear in that it produced 
minimal behavioral disruption of the infant’s progress toward 
the toy and minimal orienting to the mother. 

Because emotion links higher brain functions with lower 
brain functions (Lang, 1994) and the limbic system has 
strong connections to motor cortex and extrapyramidal tract 
(Davidson, 2000; Gray, 1991; LeDoux, 2000), it is no surprise 
that there is some connection between voice and emotions. 
The challenge is to accurately identify these emotions to 
facilitate determining etiology of voice problems and appro- 
priate treatment programs. 


+ Voice as an Indicator of 
Psychopathology 


Affective Disorders 


If voice reflects both personality and emotional states, one 
would assume that voice would also serve as an indicator of 
psychopathology. Indeed, clinicians have often speculated 
on a link between psychopathology and voice. Affective dis- 
orders are psychiatric syndromes characterized by alter- 
ations in mood, such as depression, manic or hypomanic 
states, euphoria, and mood swings. A depressed outlook 
either may be lifelong or can occur for the first time in a usu- 
ally cheerful person who unexpectedly becomes chronically 
sad, tearful, pessimistic, slow of thought, agitated, disinter- 
ested, unable to sleep and eat, and who senses a decline in 
feelings of self-worth. In one such group of 40 patients who 
demonstrated sadness, retardation of thought processes, 
anorexia, and insomnia, Newman and Mather (1938) described 
their voices as dead or listless, with narrow pitch range, 
infrequent pitch changes, slow rate, frequent pauses and 
hesitations, and reduced stress or emphasis patterns. In a 
second group that exhibited chronic gloom, self-pity, and 
dissatisfaction, speech patterns were different from the first; 
these had long gliding inflectional pitch changes over a wide 
pitch range, lively voices, normal rates but frequent pauses, 
harsh voice quality, and crisp articulation. 

Monotonous downward inflectional patterns have been 
found in depressed patients (Moses, 1954). Whitman and 
Flicker (1966) found that pitch became higher and loudness 
lower in proportion to the degree of depression. In agitated 
depression, they found an increased articulatory rate as 
patients became more depressed. In a study done by Harg- 
reaves and Starkweather (1964) in which acoustic spectro- 
graphic recordings were made during treatment of depression, 
as patients improved, their overall vocal intensity and their 
energy in the higher vowel formants increased. In 10 chroni- 
cally depressed patients, five showed reduced loudness and 
diminished inflectional changes. Moreover, patients in the 
manic phase of manic-depressive reactions produce a very 
different voice and articulation profile: press of speech; 


flight of ideas; vigorous articulation; clear, lively, and vital 
voice; wide pitch range; frequent gliding pitch changes; fre- 
quent emphasis and accent; and exaggerated pauses. As 
these patients become less manic, their pitch range narrows, 
emphasis patterns are reduced, pauses and hesitations begin 
to appear, and articulation becomes less vigorous (Newman 
and Mather, 1938). 

Psychiatrists and psychologists have long acknowledged 
that they can tell much about their patients’ emotional state 
and underlying psychodynamics by listening to their voices. 
The psychotherapist also uses his or her own voice in ways 
that will create a more effective relationship with the 
patient. Reminding us that virtually all methods of psy- 
chotherapy involve two or more persons talking together, 
Bady (1985) wrote: 


Since I have been listening to my patients’ voices I have also 
noted that changes in vocal quality accompany their 
progress. One patient achieved a lighter, less ponderous 
voice as he became more comfortable with his emotions. 
Another slowed down his rapid pace. A third achieved a 
less grating and more fluid tone. 


In her own use of voice as a means of making her therapy 
more effective, the author further states: 


In my work I will intentionally use my voice along with my 
words. Sometimes I attempt through vocal tones to soothe 
an anxious, agitated patient. Other times I use my voice to 
stimulate a depressed and hopeless one. On still other 
occasions I talk to give the patient a human response and 
my words are less important than the vocal indication of 
my presence. Sometimes I remain silent to encourage sepa- 
ration from me. Occasionally my voice backfires on me, as 
when a patient notices my anger or my anxiety through 
the sound of my voice. 


Noticeable differences between “peak” and “poor” therapy 
hours can be determined by an analysis of psychotherapists’ 
voices. In one study, the therapist’s voice during peak ther- 
apy sessions was described as more open, lower in pitch, and 
of softer intensity than during the poor therapy hours, the 
former giving the impression of warmth, seriousness, relax- 
ation, and closeness (Duncan, Rice, and Butler, 1968). 

Considerable controversy surrounds questions concerning 
if personality factors and psychologic adjustment should be 
considered causal, concomitant, or consequential to voice 
disorders. Roy and Bless (1999, 2000a,b) attempted to 
answer this question by comparing five groups of speakers, a 
vocally normal group and four groups of disorders, on meas- 
ures of personality and psychologic adjustment. The major- 
ity of functional voice disorders were classified as introverts 
and the majority of vocal nodules subjects as extroverts. 
Mirza, Ruiz, Baum, et al. (2001) found similar results in their 
study of 53 voice-disordered patients. They also noted that 
patients with vocal fold paralysis may be at risk for subse- 
quent psychiatric problems. 


Schizophrenia 


The study of voice of schizophrenics has a long history. 
Although the majority of literature has been based on 
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clinical reports and single-case studies, the similarities 
between observations suggest there may be distinct devel- 
opmental vocal characteristics in this population. 

In child schizophrenics, Goldfarb, Brownstein, and Lorge 
(1956) described hypernasality; breathiness; hoarseness; 
glottalization; flat voice quality; insufficient or inappropri- 
ate volume and pitch changes; narrowed total pitch range; 
tendency toward excessively high pitch; inappropriate rate 
changes from phrase to phrase; prolongation of phonemes, 
syllables, and words; chanting quality; insufficient or inap- 
propriate stress; hesitation after and repetitions of sounds or 
syllables; and flat, monotonous prosody. In another study, 
Goldfarb, Goldfarb, Brownstein, and Scholl (1972) noted the 
absence of a single specific clustering of these speech abnor- 
malities in child schizophrenics, with the basic disturbance 
being loss of control and of regulation of speech. 

In adolescent schizophrenia, Ostwald and Skolnikoff (1966) 
described a 15-year-old male’s speech as having a nasal qual- 
ity, articulatory imprecision or indistinctness, exaggerated 
intonational patterns, and a high and prolonged rise in pitch 
toward the ends of questions. As in childhood schizophrenia, 
there was excessive variability of rate and rhythm. Similarly, 
Chevrie-Muller, Dodart, Sequier-Dermer, and Salmon (1971) 
found reduced pitch range, reduced rate, and increased pause 
time in female adolescent schizophrenics. 

In an adult schizophrenic, melody or pitch patterns were 
found not to glide but to jump at intervals and without rela- 
tionship to ideational content. Accent (emphasis or stress 
pattern) was inappropriate, and speech contained rhythmic 
monotonous repetitions of vocal patterns (Moses, 1954). 
Moskowitz (1951) described the speech of adult schizo- 
phrenics as monotonous, weak, flat, colorless, and gloomy. 
Saxman and Burk (1968) found significantly slower reading 
rates in adult schizophrenics than in normal subjects. 


+ Voice as an Indicator of Life Stress 


Clinical and instrumental studies prove that otherwise 
normal voice perceptibly and measurably changes under 
emotional stress. A smooth, well-modulated voice signifies 
cortical control over the emotionally primitive, phylogeneti- 
cally older nervous system. During subcortical emotional 
release, phonatory and respiratory control disintegrates. 
Massive automatic fight-or-flight reactions prepare the 
organism for increased physical work—fixing the upper 
extremities to the thoracic cage for combat, requiring firm 
adduction of the vocal folds and wide abduction to facilitate 
an increased volume and flow of oxygen to meet the body’s 
increased metabolic demands. Such emergency physiologic 
states are incompatible with fine voice pitch, loudness, and 
quality control. 

Most group studies to determine the effect of stress on 
voice have induced stress by either exposing subjects to 
threatening stimuli, asking them to lie, or requiring them to 
perform difficult tasks. These studies have been criticized, 
however, because the stresses did not represent real-life sit- 
uations and therefore their intensity may not have been suf- 


ficient to induce measurable emotional responses. Male- 
female differences were sometimes not taken into account, 
and subjects have been noted to mask their voice reactions 
to emotion. Despite these objections, the results of a cross 
section of studies seem to have established that increased 
voice fundamental frequency (pitch) is directly related to 
increased stress. Aviators and astronauts in distress show 
increased fundamental frequency proportional to the 
amount of stress experienced. Simonov and Frolov (1973), 
reporting on the effects of stress and fatigue on the voices of 
cosmonauts, found that increased heart rate correlated posi- 
tively with increased intensity of the higher vowel formants. 
Voice pitch elevation corresponds directly with increased 
emotional tension in pilots, according to Khachatur’ yants 
and Grimak (1972), Popov, Simonov, Frolov, et al. (1971), and 
Williams and Stevens (1969). 

Voice patterns may also be indicative of personality type 
of speakers without voice disorders. Males with type A per- 
sonality were found to be seven times more likely to develop 
coronary artery disease than type B personalities (Friedman 
and Rosenman, 1959). The type A personality has such traits 
as excessive drive, ambition, aggressiveness, impatience, and 
a habitual sense of time urgency when challenged by the 
environment. Subsequently, Rosenman, Friedman, Straus, 
et al (1964) discovered that perceptually type A personalities 
had similar voice and speech patterns: explosive semiviolent 
accentuations in prosody that carried a “certain aggressive 
timbre” but that appeared only when that person was inter- 
ested in or excited about the subject under discussion. The 
type B personality spoke in an “unruffled, rather smooth 
manner without explosive or aggressive accents.” 

These observations were made at the time by clinical 
description and were not masked to the observer, introducing 
a possible observer bias. Subsequently, Friedman, Brown, and 
Rosenman (1969) completed an acoustic study of voices of a 
cohort of male subjects with type A or B personalities but free 
of coronary artery disease. To induce exhibition—the eager- 
ness to win or excel at a challenging activity and to release 
latent hostility—a two-paragraph diatribe was composed in 
which an officer exhorts his troops prior to battle. The subject 
was asked first to look over the paragraph in preparation for 
reading it into the microphone. The investigators reported 
that even prior to the experiment itself, the type A subjects 
(n = 19) blurted out the identical question “Do you want me 
to read this with expression?” whereas no type B subjects 
(n = 18) ever asked that question. Type B subjects had few or 
no vocal escalations, average noise-free period (45 seconds) 
nearly identical to the average total time (46 seconds), result- 
ing in an average index of 1.06, the ratio of the total reading 
time to noise-free time. The majority of type A subjects, 14 of 
19, exhibited several vocal escalations, had significantly less 
average noise-free periods than type B subjects, and a signif- 
icantly greater average index of 1.50. From this study, the 
authors concluded that voice analysis successfully segregated 
the majority of type A from type B personalities. 


Psychogenic Voice Disorders 


Numerous studies have suggested a link between stress and 
voice resulting in psychogenic-based voice disorders (Aronson, 
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Peterson Jr., and Litin, 1966; Bhatia, 2000; Dietrich and Ver- 
dolini, 2005; Jain et al, 2000; Lauriello et al, 2003; Mirza et al, 
2003; Roy and Bless, 2000; Roy et al, 2000). In the literature 
and in the clinic, the terms functional, psychogenic, psychoso- 
matic, and nonorganic are used synonymously in reference to a 
group of voice disorders that exist in the absence of organic 
laryngeal pathology. The label functional appears most often. 
However, from a survey of clinicians’ usage, Perelló (1962) 
found no fewer than eight interpretations of “functional voice 
disorder”: (1) no apparent alteration in structure detected by 
laryngoscopic examination; (2) negative laryngoscopic exami- 
nation but positive stroboscopic examination; (3) dysphonia 
disproportionately severe when compared with existing 
anatomic lesions or inflammations; (4) disorder of nervous 
origin with variable symptoms and conducive to the formation 
of organic lesions; (5) disorder reversible (as opposed to 
organic disorders, which are usually irreversible); (6) function 
is altered (the disorder disappears when the organ is used cor- 
rectly); (7) incorrect motor utilization (therefore, it is a dys- 
function); and (8) desire to mask ignorance of exact cause. 

Objection to the term functional mainly has to do with its 
ambiguity and lack of precision, especially in light of the fol- 
lowing realities: (1) There is more than one functional voice 
disorder. (2) Functional voice disorders not only sound dif- 
ferent but also occur for different reasons and, therefore, 
should be considered as separate subtypes. (3) Most sub- 
types can be traced to emotional or psychologic causes. More 
accurate, then, is the term psychogenic voice disorders. A psy- 
chogenic voice disorder is broadly synonymous with a func- 
tional one but has the advantage of stating positively, based 
on an exploration of its causes, that the voice disorder is a 
manifestation of one or more types of psychologic disequi- 
librium, such as anxiety, depression, conversion reaction, or 
personality disorder, that interfere with normal volitional 
control over phonation. 

Why do only certain persons react to environmental stress 
and interpersonal conflict by developing an abnormal voice? 
Why is it that others who have emotional reactions to stress 
and conflict never have voice problems? Why is it that still 
others express their personal problems through some organ 
system other than the larynx? No satisfactory answers to 
these questions have been found. Perhaps each person is 
predisposed by personality or physiologic makeup to hyper- 
react through a particular neuromuscular or visceral system. 
Those who are prone to develop voice disorders might be 
called laryngo-responders to designate their predisposition 
to developing laryngeal and voice disorders as their unique 
avenue for the expression of emotional distress. 

If everyone who was unable to express emotion developed 
an abnormal voice, not just certain predisposed persons, 
clinics would be overwhelmed by patients with psychogenic 
voice disorders. The distinguished director and writer Elia 
Kazan (1988) gives a casebook description of what it is like 
not to be able to express one’s feelings openly. To anyone’s 
knowledge, he never developed a psychogenic voice disor- 
der, but if he had, the following thoughts would have fit the 
disorder perfectly. 


When someone works in the arts, he works from craft, not 
emotion. I'd done play after play that I had no true feeling 
for, and I'd again and again suppressed the feelings I did 


have, choked them off, my hands pressing at my throat, sti- 
fling the scream that, if it could be heard, would be my true 
voice. 

“Speak now,” I said to myself, “release your true feel- 
ings before it’s too late. Be yourself. Take your place in the 
world. You are not a cosmic orphan. You have no reason 
to be timid. Respond as you feel. Awkwardly, crudely, vul- 
garly—but respond. Leave your throat open. You can have 
anything the world has to offer, but the thing you need 
most and perhaps want most is to be yourself. Admit 
rejection, admit pain, admit frustration, admit pettiness, 
even that; admit shame, admit outrage, admit anything 
and everything that happens to you, respond with your 
true, uncalculated response, your emotions. Work on it. 
Stir up the lump whenever you can. Raise your voice.” 


Based in part, on the work of Lazarus and Folkman (1984) 
Dietrich and Verdolini (2005) offer a psychoneuroimmuno- 
logic (PNI) framework to explain the interactions among 
cognitive, emotional, biological, and behavioral pathways in 
mediating stress-related voice responses. According to this 
framework, stress-induced voice changes occur when envi- 
ronmental demands exceed the individual’s abilities to cope 
with threat or challenge (Lazarus and Folkman, 1984). 
Chronic stressors generate physical disease by causing neg- 
ative affective states (e.g., anxiety, depression), which in 
turn affect biological processes that influence disease risk. 
Of particular concern to voice and the work of Dietrich and 
Verdolini are stress-induced biological processes involving 
the autonomic nervous system (ANS). Based on ANS litera- 
ture on other areas of the body, they speculate that acute or 
chronic stressors may increase muscle tone, increase or 
decrease blood flow, and decrease upper airway secretions. 
Effects on voice would involve muscle tension, capillary 
abnormalities, and sicca laryngeal conditions, which could 
influence laryngeal function (e.g., muscle tension dyspho- 
nia) and laryngeal structure (e.g., phonotrauma). This 
intriguing physiologic model provides a practical explana- 
tion of musculoskeletal tension disorders. 

An alternative nonlinear model of behavior described by 
Gustafson and Meyer (2003) to explain a case of panic 
appears to apply equally well to voice. They make several 
assumptions in their nonlinear model that are different than 
those models assuming that stress is cumulative; that when 
environmental demands exceed the individual's abilities to 
cope with threat or challenge a problem begins. Gustafson 
and Meyer assume that the dynamics bringing about a 
pathologic state are nonlinear, that the discontinuity can be 
expressed in two-dimensional space as a catastrophe where 
a patient pushes himself or herself from point (a) to (b) ina 
normal state of affect and suddenly finds himself or herself 
in high anxiety or panic (c) or collapsed into depression (d). 
What follow are further catastrophes or complications. Anx- 
iety that is sufficiently threatening to affect functioning 
leads to drastic compensations to bring it to manageable lev- 
els, such as hysterical somatization, or phobic avoidance, or 
borderline self-destructive acts (Gustafson, 1999). The 
patient is in a naive state in which her plan of action is unre- 
alistic (Gustafson, 1997, 2000) even though the strategy may 
have been successful in the past. For example, a child may 
have been successful in deflecting harm from abusive 
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parents by being particularly nice only to find out as an adult 
that the same behavior may lead to being taken advantage of 
by a spouse or employer and thus she finds herself in a catas- 
trophe of panic or collapse (Gustafson, 1986). It takes little 
imagination to see how this model applies to voice. 


Musculoskeletal Tension Disorders 


In the physically healthy individual, states of tension come 
about in two ways: (1) from exogenous sources, such as over- 
work, tense working conditions, professional worries, 
unhappy family life, or unfavorable situations of occupational 
placement such as poor ergonomic working postures, work- 
ing in noisy environments, or working around pollutants; and 
(2) from endogenous sources, such as peculiarities of person- 
ality structure that tend to induce tension (e.g., perfection- 
ism, compulsive attitudes, overambitious drives, lack of 
adaptability through inflexible rigidity, or uncontrolled out- 
bursts of anger). Under such conditions, tensions may build 
up without appropriate release. States of increased tension 
eventually lead to augmented nervous irritability. As a result 
of this continuously increasing state of neural reaction, the 
musculature is incited to higher tonus, in excess of the actual 
demands made on it (Luchsinger and Arnold, 1965, p. 15). 

One cardinal principle in clinical voice disorders is the fol- 
lowing: The extrinsic and intrinsic laryngeal muscles are 
exquisitely sensitive to emotional stress, and their hypercon- 
traction is the common denominator behind the dysphonia 
and aphonia in virtually all psychogenic voice disorders. It is 
unlikely that there is any single factor responsible for the 
voice problem. Anxiety, anger, irritability, impatience, frus- 
tration, personality, response to disease states, stress, and 
depression may all be contributing factors. On laryngoscopic 
examination, the vocal folds either are normal, are mildly 
inflamed, fail to adduct completely, hyperadduct, or are 
slightly bowed. In general, however, the extent of visible 
pathology is incongruously minor or absent in comparison 
with the severity of abnormal voice. 

Musculoskeletal tension can be detected manually by the 
clinician and can be sensed by the patient. The tension pro- 
duces elevation of the larynx and hyoid bone. The clinician 
tests for such tension by encircling the larynx with the 
thumb and middle finger in the region of the thyrohyoid 
space, feeling to determine whether the space has been 
narrowed by laryngeal elevation. The larynx and hyoid 
bone resist being manually moved laterally or vertically. 
When the larynx and hyoid bone are pressed and kneaded 
in the areas indicated, the patient will respond with dis- 
comfort or pain, sometimes quite severe, depending upon 
the degree and duration of the muscle tension. The pain 
will be either unilateral or bilateral, although more often it 
is unilateral. The normal larynx does not respond in this 
manner, so that discomfort and pain with normal-appear- 
ing vocal folds are most often diagnostic of laryngeal mus- 
culoskeletal tension. If the larynx is pulled down after the 
sensitive areas are kneaded, and if decreased hoarseness 
and breathiness and increased volume follow, such is proof 
that musculoskeletal tension was responsible for the dys- 
phonia. This test is also the main component of the therapy 
for these disorders. 


In the Conversion Aphonia section later in this chapter, it is 
stated that patients who whisper as a sign of a conversion 
reaction also exhibit extreme musculoskeletal tension, with 
the larynx suspended high in the neck and the entire hyoid- 
laryngeal sling remarkably stiff. The aphonic does not whis- 
per because of passivity of the musculature; on the contrary, 
the musculature is under supercontraction. This fact was 
demonstrated in the laboratory by Tarrasch (1946), who 
recorded electromyographic potentials from the sternoclei- 
domastoid and thyrohyoid muscles bilaterally during quiet 
respiration and attempts to phonate. She found considerable 
spasticity of the musculature in aphonic patients. Van den 
Berg (1962) cites Faaborg-Andersen’s electromyographic 
study of the intrinsic muscles of the larynx in normal sub- 
jects who whispered voluntarily. Even under such artificial 
conditions, increased electrical activity was found in the 
adductor laryngeal muscles. Similarly, in an x-ray tomo- 
graphic study done by Zaliouk and Izkovitch (1958), a patient 
with conversion aphonia showed hypercontraction as 
reflected in increased constriction of the supraglottic larynx. 

Not only does laryngeal musculoskeletal tension produce 
discomfort or pain when the examiner applies pressure in 
the regions described, but also patients volunteer informa- 
tion about spontaneous pain radiating to the ear, sternum, 
and midchest. They feel as if a foreign body were stuck in the 
throat and report difficulty in swallowing, constriction, and 
swelling or compression in the pharyngolaryngeal area. They 
have what is referred to as a globus sensation. Globus is 
reportedly related to psychogenic disorders and recently 
also has been strongly related to laryngeal esophageal reflux 
as described elsewhere in this book. Takahashi, Hinohara, 
Ohmori, et al. (1971) reported on 652 cases of a disagreeable 
sensation in the pharyngolaryngeal region and found a high 
incidence of neurotic tendencies in this group. Some had car- 
cinophobia after a close friend or relative had died of the dis- 
ease. They concluded that dysphoric affective states play an 
important part in producing these sensations. The relation- 
ship between dysphonia, musculoskeletal tension, and envi- 
ronmental stress is illuminated in the following case study 
without any known reflux disease. 


Case Study 8.1 


A 37-year-old single female, a registered nurse, noted a 
“cracking” of her voice 5 months prior to coming to the 
clinic, at first only a few times a week but becoming more 
frequent, so that by the end of 3 weeks she had become 
continuously hoarse. She consulted her family doctor, who 
thought that her dysphonia was due to postnasal drip and 
prescribed cough syrup, and, later, ampicillin without ben- 
efit. She was next examined by an otolaryngologist, who 
noted irritation of her vocal folds but did not offer a diag- 
nosis or treatment. Two months after onset, her voice 
returned to normal while she was on vacation in Mexico, 
but it worsened upon reentry to the United States. She 
went back to her family doctor, who prescribed Valium 
(Diazepam) and Decadron (Dexamethasone), which were 
also ineffective, and attempted to treat her disorder as if it 
were myasthenia gravis by prescribing Mestinon (Pyri- 
dostigmine), which also was of no benefit. 
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Indirect laryngoscopy showed normal vocal folds, and 
neurologic examination failed to disclose any evidence of 
myasthenia gravis or other neurologic disease. Her gen- 
eral medical examination was normal. 

Her voice was continuously strained, hoarse, and pro- 
duced with above-normal physical effort. On vowel pro- 
longation, she produced a clear voice at a higher pitch 
level. Her cough and glottal coup were normally sharp, 
indicating good vocal fold adduction. She reacted with 
discomfort and pain, grimacing in response to pressure 
over the posterior region of the hyoid bone bilaterally. Her 
larynx, at rest and on phonation, was positioned high in 
her neck, close to the hyoid bone. Her voice improved 
momentarily when the larynx was moved downward by 
the examiner during vowel prolongation. Asked if her dys- 
phonia could be a reaction to emotional stress, and asked 
to describe any upsetting events that might have 
occurred surrounding the onset and development of her 
dysphonia, she became momentarily silent, frowning, as 
if taken off-guard by the question. After a few moments’ 
reflection, she smiled and, as if it were occurring to her for 
the first time, said that in fact, she had been struggling 
with a distressing situation. Shortly before her dysphonia, 
a local newspaper editorial had appeared that was critical 
of the costs of the patient’s position as a public health 
nurse and that argued for the elimination of her job. A bit- 
ter conflict broke out between her department and the 
editor, with accusations flying in both directions. Caught 
in the middle, the patient grew increasingly angry over 
the unfairness of the allegations, yet had no means of 
responding to them. 

As she disclosed this information, her frustration and 
anger became apparent. Her face flushed, and although 
on the verge of tears, she continued to talk. During the 
next few minutes, her voice spontaneously returned to 
normal. She was gratified and surprised at this sudden and 
unexpected improvement. She said she had suspected 
that her voice disorder had something to do with the con- 
flict at home but had not appreciated its importance until 
the question had been put to her directly. She ought to 
have suspected the connection, she said, when her voice 
had begun to improve after a resolution of her job prob- 
lem shortly before coming to the clinic. 


This case illustration is not atypical. Consultation with 
several specialists before arriving at a diagnosis and treat- 
ment of the disorder is typical in patients who have muscu- 
loskeletal tension dysphonia. Perceptually undramatic and 
indistinguishable from hoarseness due to other causes, mus- 
culoskeletal tension dysphonia is treated erroneously as if it 
were laryngitis. Diagnostically important in the cited case 
were the strained voice, tightness, hoarseness, history of 
voice return while on vacation, and pain in response to pres- 
sure in the region of the larynx. The diagnosis was confirmed 
when the voice returned spontaneously as the patient venti- 
lated her feelings. Loosening of her character armor, physical 
relaxation, abandoning of defenses, and freeing of affective 
expression through weeping or laughter are diagnostic and 
therapeutic in such cases. 


Vocal Abuse 


Hyperadduction of the vocal folds occurs when there is 
increased closing force, or adduction time, in the glottal 
cycle. It can occur as a primary problem resulting in a sec- 
ondary pathology or as a compensatory movement second- 
ary to an underlying pathology. A problem for clinicians is 
determining which came first: the chicken or the egg. The 
vocal consequences of hyperadduction perceptually range 
from complete aphonia to a mildly hoarse voice. The patho- 
logic consequences of hyperadduction of the vocal folds are 
typically one of three kinds of secondary laryngeal patho- 
logic states: (1) inflammation and edema, (2) vocal nodules 
or polyps, or (3) contact ulcers. There is a tendency to think 
of these as vocal abuse disorders because they are often 
associated with strenuous speaking, singing, yelling, scream- 
ing, coughing, and throat-clearing. But abuse or misuse is too 
simple an explanation for these complex disorders. These 
hyperadduction voice disorders may be the consequence of 
lingering compensatory strategies adopted during bouts of 
laryngitis secondary to upper respiratory infections (URI), 
they may be related to personality traits and associated 
responses to excessive environmental demands or responses 
to illness, they may be an interaction with tissue irritation 
from laryngeal-esophageal reflux, or they may be an inter- 
mediate link in the chain of causes that begins with an emo- 
tionally determined impetus to vocalize aggressively. 
Regardless of the cause, the effect of such behavior is to pro- 
duce traumatic lesions, leading many clinicians to categorize 
vocal nodule or contact ulcer as organic voice disorders. But, 
from a fundamental etiologic point of view, they are proba- 
bly better characterized as functional or behavioral, because 
the lesions were the result of abnormal speaking behavior, 
lesions are not always present, and they are often motivated, 
or at least compounded, by personality or emotional factors. 

Across the life span, the effects of vocal abuse exist on a 
severity continuum. School-age children who abuse their 
voices will have either normal-appearing vocal folds, 
inflammation without discrete lesions, or discrete lesions. 
Senturia and Wilson (1968) reported on 32,500 children 
aged 5 to 18 years and found that 1962 (6%) had abnormal 
voices. Of that abnormal group, 338 aged 6 to 11 years were 
given voice, hearing, psychometric, and laryngologic exami- 
nations. The males predominated over females by a ratio of 
2:1. Of these voice-defective children, 147 were given laryn- 
goscopic examinations, and 47.5% were found to have vary- 
ing degrees of redness or edema or both in the region of the 
arytenoid cartilages. Interestingly, redness or edema or both 
in the region of the arytenoid cartilages is one of the cardi- 
nal signs of laryngeal esophageal reflux (Koufman, 1991). 
On indirect laryngoscopy in these 147 children, only 92 had 
vocal folds that could be completely visualized. Of those 92 
children with abnormal voice, 63 had discrete lesions such 
as nodules, hyperplasia, and diffuse or localized edema; 
again, conditions often attributed to laryngeal esophageal 
reflux with or without accompanying vocally abusive 
behaviors. The predominance of lesions in males compared 
with females was almost 3:1 in children having discrete 
vocal fold lesions and more than 2:1 in those not having dis- 
crete lesions. As the vocal folds lengthen during puberty 
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more for males than females, this ratio is reversed because 
the impact stress of vocal fold contact is distributed over a 
larger area in males and concentrated in females at the mid- 
membranous portion of the vocal folds. The voice signs 
within this group of 92 children consisted of whispering, 
breathiness, and stridency. 

The amount and type of classroom voice use of children 
diagnosed as having hoarse voices was compared with that 
of children who had normal voices by Barker and Wilson 
(1967). The length and loudness of utterances were tabu- 
lated for 14 children in the control group (normal voice) 
and for 14 in an experimental group (hoarse voice). The 
researchers found that the children with normal voices pro- 
duced an average of 20.92 vocalizations in a 2-hour period 
compared with 61.50 vocalizations by the children with 
hoarse voices within that same time period. The difference 
between the two groups was significant beyond the 0.01 
level. Of interest is the observation that the children with the 
abnormal voices were more active within the classroom dur- 
ing unstructured periods than were the children with nor- 
mal voices. These indications of extraversion are consistent 
with Roy and Bless (1999) data on adults suggesting individ- 
uals with vocal nodules are often extraverted and have an 
inability to inhibit and consequently might be more likely to 
talk more and louder than their introverted counterparts 
who developed functional voice disorders with more open 
glottal postures. 

The theme of personality aggressiveness, hyperactivity, 
emotional reactivity, and family problems in dysphonic chil- 
dren with and without discrete vocal fold lesions is of con- 
siderable importance in understanding the factors responsi- 
ble for vocal abuse. Barker and Wilson, who were social 
workers, investigated the family structure of both the chil- 
dren with voice problems and those without. Of 153 who 
were classified as having abnormal voices, 65% came from 
homes in which there was excessive family conflict, whereas 
35% came from homes in which there was considered to be 
none. Of 97 children whose voices were normal, only 35% 
came from conflict-ridden homes (i.e., almost twice as many 
children with abnormal voices came from home environ- 
ments that were considered pathologic). 

One cause of vocal abuse after puberty is cheerleading. 
Many studies have proved that the result can be chronic dys- 
phonia with or without vocal fold damage. Jensen (1964), for 
example, found that 12% of 377 cheerleaders were dyspho- 
nic. Andrews and Shank (1983) found that 37% of 102 high 
school cheerleaders had a history of voice problems. Reich, 
McHenry, and Keaton (1986) surveyed 146 female cheerlead- 
ers ranging in age from 15.5 to 18.6 years and found that, at 
one time or another, 32% had experienced acute aphonia and 
86% complained of dysphonia. The average number of times 
that these students became aphonic during a given season 
was 4.2 and dysphonic 7.3. Cheerleaders present a particu- 
larly interesting group because they are generally believed to 
be extraverts, and cheerleading engages them in vocally abu- 
sive activities. 

After high school, occupation scores as one of the major 
etiologic factors in abuse. Teaching ranks among the highest 
of those occupations identified as vocally abusive. Teaching 
is demanding, causes emotional stress, and requires consid- 


erable talking. The amount of talking is greater in the lower 
elementary grades where teachers are thought to be at 
greatest risk for developing voice disorders. Acoustic analy- 
sis of teacher talking patterns has demonstrated that teach- 
ers in grades K through 3 talk louder, longer, and at higher 
pitch (Roy, et al, 2004). 


Vocal Nodules 


Vocal nodules are associated with vocal misuse or abuse. The 
nodule is a small, white or grayish protuberance on the free 
margin of the vocal fold at the junction of the anterior and 
middle third. It is a tissue reaction to frictional trauma 
between the folds, a growth of hyperkeratotic epithelium 
with underlying fibrosis. The nodules may begin with a sub- 
mucous hemorrhage and may develop from the fibrosis of an 
organizing hematoma. In the very early stages of vocal 
abuse, there may be nothing more than a localized capillary 
hemorrhage, swelling of the vocal fold edges, and redness of 
the folds. Later, fibrosis and the fully developed, roughly 
semicircular, grayish or white nodule form. 

Vocal nodules represent a degeneration of the lamina pro- 
pria with fibrosis and edema. Both the visual appearance and 
histologic nature of nodules is controversial though it has 
been suggested that acute vocal nodules are morphologically 
different from chronic. In the acute phase, the squamous 
epithelium is normal but covers an edematous stroma with 
thin-walled blood vessels, loose fibrous tissue, and lympho- 
cytes. In the chronic phase, the nodules possess a thickened 
epithelium and demonstrate acanthosis, keratosis, and fibro- 
sis with minimal edema of the underlying connective tissue. 

Three stages of vocal nodule formation have been de- 
scribed: (1) a local accumulation of fluid in the subepithelial 
layer of the vocal folds; (2) an organized inflammatory 
response with accumulation of protein and increased vascu- 
larity; (3) further organization of the lesion with fibrosis and 
possibly keratosis of the epithelium (Arnold, 1962; Gray, 
1991). Nodules vary in size from no larger than a pinhead to 
as large as a split pea. Early nodules may appear red, gelati- 
nous, and floppy, whereas chronic nodules are typically 
white and conical or hemispherical and appear hard and 
fixed to the underlying mass of the mucosa (Vaughn, 1982). 

The nodule is located at the midmembranous portion of 
the vocal fold because it is the point of maximum impact 
stress, or trauma, to the vocal fold. In cases of loud phona- 
tion, the impact stress is greater because of the larger sub- 
glottal pressures displacing the tissue to a greater extent, 
and in cases of singing, or high-pitched productions, the 
vocal fold collisions occur more frequently. After the nodule 
has been formed, the resultant dysphonia is perceived as 
breathy, husky, or foggy in quality, with a tendency toward 
low pitch. Interestingly when the fundamental frequency of 
the pitch is determined, the frequency is not lower com- 
pared with age-matched controls. The perception of lower 
pitch comes from the combination of the qualities of breath- 
iness and hoarseness. The dysphonia results from the 
increased vocal fold mass and change in the vibratory pattern 
particularly in the hourglass-shaped closing pattern. As the 
vocal folds project into the glottis, they cause air-stream tur- 
bulence and prevent the vocal folds from adducting anterior 
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and posterior to the lesion. The breathy or husky quality is 
the effect of failure of vocal fold closure in the nonadducted 
areas around the nodules. 

Contributing further to the notion that children who 
develop vocal nodules may differ in personality and family 
history from those who do not, Nemec (1961) observed that 
children with hyperfunctional dysphonia, including vocal 
nodules, were more aggressive and less mature and had more 
difficulty in managing stressful situations than children with 
normal voice. Mosby (1967) studied 25 children, 16 with 
vocal nodules and 9 with other types of dysphonia, by means 
of a battery of psychologic tests. He found a high incidence of 
neurotic personality conflict, overrepressed aggression, feel- 
ings of inadequacy, poor relationships with parents, and 
severe dependency needs. Wilson and Lamb (1973), in a 
study of 12 children with vocal nodules and 12 with normal 
voices, concluded that children with vocal nodules had per- 
sonalities that included marked aggressiveness, lack of con- 
trol, or passive, overcontrolled adjustment to aggression. By 
administering the California Test of Personality, Glassel 
(1972) compared 15 children, aged 7 to 11 years, who had 
vocal nodules with 15 who had normal voices. He found that 
the children with nodules scored significantly poorer on all 
measures of personality adjustment; they had reduced feel- 
ings of belonging, a reduced sense of personal worth, with- 
drawal tendencies, and antisocial feelings. However, not all 
children with nodules were vociferous according to the pop- 
ular stereotype, although they manifested the same degree of 
neurotic tendency as those who were. In a study of 77 chil- 
dren with vocal nodules, Toohill (1975) found that the par- 
ents of 62 children with nodules described their children as 
screamers, incessant talkers, or loud talkers. Sixty-six were 
described as having one or more of the following personality 
traits: aggressiveness, hyperactivity, nervousness, tenseness, 
frustration, or emotional disturbance. 

Further suggestion of psychoneurotic tendencies in chil- 
dren who develop vocal nodules comes out of a study by 
Green (1989). She had the mothers of 30 children with vocal 
nodules and 30 control mothers rate their children’s behav- 
ior on a standardized checklist of abnormal behaviors called 
the Walker Problem Behavior Identification Checklist. The 
children with vocal nodules, 22 of whom were male and 8 
female, ranging in age from 3 to 12 years, received a substan- 
tially higher percentage of abnormal ratings than the control 
group. More than 60% of the children with vocal nodules 
were rated as positive on the following traits: (1) temper 
tantrums when the child cannot get his way; (2) argues and 
wants to have the last word; (3) does not obey until threat- 
ened; (4) complains of others’ unfairness. Also rated high 
was the tendency toward overactivity and restlessness and 
the need for approval for tasks attempted. Not more than 
20% of the children with normal voices received ratings for 
these behaviors. From this study, the author concluded that: 


The concept of the development of vocal nodules in chil- 
dren has traditionally conjured a fixed mental picture of 
vocal abuse in a vociferous child. Current research suggests 
that such a focus is too simplistic and that the people work- 
ing with children with vocal nodules may need to consider 
a multiplicity of etiological and management variables 


including acid reflux, small vocal folds, laryngeal 
microwebs, personality, family, and classroom dynamics. 


Vocal nodules also have been found in association with 
velopharyngeal insufficiency independent of any particular 
personality traits. The need for higher intraoral pressures 
and for increased loudness causes overdriving of the vocal 
folds. (See Chapter 4 for a more detailed discussion.) 

The etiology and pathophysiology of vocal nodules in 
adults are basically the same as in children, with minor vari- 
ations. As previously noted, whereas the incidence of vocal 
nodules is higher in male children than in females, this trend 
is reversed in adults. Singing and occupation as primary or 
contributory causes are strong in adulthood. It is also likely 
that these females’ vocal folds have shorter vibratory lengths 
so that the impact stress at the midmembranous region 
where nodules occur is greater. Additionally, adult females 
with nodules have some of the same personality characteris- 
tics and daily habits as children who develop nodules. They 
are talkative, socially aggressive, and tense and have acute or 
chronic interpersonal problems that generate tension, anxi- 
ety, anger, or depression. Even when the nodules appear to 
be the sole result of abuse from singing or other strenuous 
vocal activity, it is often found that these were not the only 
factors responsible for the vocal overuse; these patients had 
often also entered a period of their lives in which concomi- 
tant emotional stress had surfaced. 

A review of a few case histories of individuals with vocal 
nodules highlights both the similarities and differences 
often seen between cases. 


Case Study 8.2 


A 63-year-old married female was referred by the Depart- 
ment of Otolaryngology with a diagnosis of bilateral small 
vocal nodules. She had begun to complain of hoarseness 
8 years previously whenever she had to use her voice 
strenuously, such as over background noise or during ani- 
mated conversation. She had been seen by laryngologists 
and speech clinicians who had made the diagnosis of 
vocal abuse and given “exercises,” which were ineffective. 
Aside from her voice difficulties, her health had been 
good, although she complained of general fatigue with 
prolonged voice use. 

On examination, her voice was nearly normal; there was 
a slight hoarseness consistent with the size of the nodules 
observed by the otolaryngologist. Complete motor 
speech examination failed to disclose any evidence of 
dysarthria, and the remainder of the examination of the 
speech structures failed to disclose any other defects. 

While giving the social-psychologic history, the patient 
said she believed that her husband was responsible for her 
voice problems. She described him as a voluble speaker, 
loud and animated. The patient, during most of her mar- 
ried life, had to raise her voice to compete with him during 
discussions, not only when they had company but also 
when they were alone together. “He dominates the con- 
versation, that’s his personality, his nature, you’re never 
going to change that... he is a shouter and | am a quiet 
talker.” Despite this marriage-long contrast in their 
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personalities, her voice troubles first began when she 
became interested in activist politics and needed to 
express her opinions in a more forthright manner. She vol- 
unteered that she believed she had gotten into the habit of 
using her voice excessively and with a great deal of tension 
as a reaction to her husband’s vocal manner. “Now, even 
when | have to talk to a group, | tense up.” When her voice 
was bad, she complained of shooting pains from her larynx 
to her ears, and on examination she had pain in response 
to pressure over the hyoid region and thyroid cartilage. 
Joint discussion with the patient and her husband con- 
firmed her testimony; he proudly confessed to being 
“vociferous, aggressive, dominating,” more concerned 
with his own opinion than hers. He described her as a 
“sounding board” and not nearly as concerned with her 
opinions as with his own. “I know it’s selfish and cruel. 
she’s a very tranquil individual and | am not. I’m the sec- 
ond last angry man!” She said to him, “You are a monopo- 
lizer of conversations, my main complaint is when there 
are other people in the conversation he interrupts them, 
and won’t let them finish. He’d rather talk than do any- 
thing else. Unfortunately, it becomes a monologue.” 


What emerges from this examination is not only a basic 
difference in how these two individuals communicate but 
also an even deeper disagreement as to what ought to be 
said and what is best left unsaid. Not only is the psychogenic 
etiology of this woman’s vocal nodules substantiated by an 
in-depth interview as a contributing etiology, but the results 
of that interview also point to the requirement for more than 
symptomatic voice therapy. 

The balance between vocal abuse and emotional stress is 
different in each patient with vocal nodules, and it is often 
difficult to separate the two. Whereas in the previous study, 
emotional and personality factors were prominent, vocal 
abuse is more apparent in the following case study of a farm 
wife, although the patient is not free of stress. 


Case Study 8.3 


A 39-year-old farm wife and mother of three boys had 
sung most of her adult life for weddings, funerals, and 
baptisms. One year before coming to the clinic, she had 
hoarseness occurring intermittently but did not consult 
anyone until 7 months after onset. Laryngologic examina- 
tion showed a thickening of the mucosa of the left vocal 
fold and a small ectasia on the posterior third of that fold. 
Laryngologic examination at the time she presented to 
the clinic, 1 year after onset, showed clear evidence of 
bilateral vocal nodules at the junction of the anterior and 
middle third of the vocal folds. 

The patient was a pleasant, energetic, healthy house- 
wife and mother of a highly musical family. In addition to 
her heavy singing schedule, her daily talking habits were 
prodigious; teaching, singing, talking, using her voice all 
day long with her children and her neighbors, and speak- 
ing on the telephone. 


Special cases of vocal nodules in adults occur in the culture 
of the pop singer, whose voice usage is often one of extreme 
abuse with or without coexisting fatigue and emotional 
stress. Greene (1972) had these interesting comments about 
this subgroup in Britain. 


Pop singers claim an immensely important role in enter- 
tainment, although professors of singing in the classical 
style do not recognize their vocalization indeed as 
singing. The senior coach of the Covent Garden Opera 
Company has said he would not consider them seriously 
as candidates for the course. Their members are legion 
and they give pleasure to millions. They are largely 
untrained, having musical sense and personality and 
pleasing voices but some, it must be admitted, have little 
natural vocal endowment. For many it is a short life but 
a lucrative one and when the voice is lost through vocal 
abuse they drop out of “show biz.” 

The vocal gymnastics and tricks peculiar to each singer, 
the double notes, acute pitch changes, the scoops and 
swirls have to be executed against the deafening competi- 
tion of amplified music. Despite the use of a hand micro- 
phone they must still “belt it out” at the top of their voices. 
The sound-pressure levels created by the bands and rock 
and roll groups are so great that they exceed the safety lev- 
els for avoidance of hearing impairment. 

The common run of cabaret and pop-group singers per- 
form for many hours on end starting at midnight and going 
on into the early hours. When not singing they often are 
smoking and talking in a dry, hot, and polluted atmosphere. 
They frequently eat late and have other lifestyle habits that 
promote laryngeal esophageal reflux. Consequently, it is 
not surprising that chronic laryngitis and vocal nodes are 
common in this population. 


Case Study 8.4 


A 19-year-old nightclub singer complained of intermittent 
hoarseness for 3 to 6 months. She was a backup vocalist in 
an eight-piece brass band and sang 6 days a week. Laryn- 
gologic examination revealed moderate-sized vocal nod- 
ules bilaterally at the junction of the anterior and middle 
third of her vocal folds. 

In addition to the close relationship between singing 
and dysphonia—the patient complained of less voice trou- 
ble when on vacation—she disclosed recent life stresses 
concerning problems with a boyfriend in the band. At the 
same time, she began to examine her career goals, sum- 
ming up her feelings over the past 6 months by saying, 
“I’m feeling a lot of emotional stress I’ve never felt before. 
Just seems like everything is coming down at once.” She 
admitted to being depressed during the time of her voice 
disturbance and personal problems. 


The foregoing case studies are suggestive that multiple 
factors may contribute to the etiology of vocal nodules, to 
which personality traits may contribute. 
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Contact Ulcer 


The second vocal abuse-musculoskeletal tension—-emotional 
stress voice disorder having secondary traumatic tissue reac- 
tion is contact ulcer, or contact ulcer granuloma. Instead of 
epithelial tissue developing, as in vocal nodule, a contact 
ulcer is an erosion of the mucosa. Occurring unilaterally or 
bilaterally, the ulcers are found at the junction of the middle 
and posterior third of the vocal folds, the intercartilaginous 
region, at the tips of the vocal processes of the arytenoid car- 
tilages. The hard cartilaginous undersurfaces of the vocal 
processes strike the mucosa of the opposite vocal fold, caus- 
ing hyperemia, the formation of granulation tissue, and, 
finally, an ulcer crater with its raised, inflamed margins. 

Contact ulcer is generally thought to be a consequence of 
laryngeal esophageal reflux or laryngeal intubation trauma. 
Since reflux has been identified as a primary etiologic factor, 
no definitive incidence or prevalence studies have been com- 
pleted. It is a disorder primarily of adult males in their 40s; 
adult females are affected less frequently, and children infre- 
quently. Male susceptibility may be due to the male ten- 
dency to adopt a low-pitched voice, moving the locus of 
force of vocal fold contact posteriorly. Ulcerations in the 
region of the arytenoid cartilage can also result from trauma 
due to laryngeal intubation. 

Clinical and research studies conducted 50 years ago 
pointed to certain common characteristics among adult 
males who developed contact ulcer (Peacher, 1947; Peacher 
and Hollinger, 1947): 


hypertonic laryngeal musculature as a component of gen- 
eralized musculoskeletal tension 

habitual use of an excessively low voice-pitch level 
explosive speech stress patterns 

sharp, abrupt glottal attack (coup de glotte) 


restricted pitch variability 
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phonation using excessively high infraglottal pressure 
with bursts of intensity 


Some patients with contact ulcer complain of discomfort 
or pain coming from deep within the neck radiating to the 
lateral neck area or ear, a tickle in the throat, an urge to clear 
the throat, a lump in the throat, and an aching or dryness. At 
one time, the classic profile of the contact ulcer patient was 
thought to be a male in his 40s who uses his voice inten- 
sively in his daily life and is either a lawyer, teacher, minister, 
actor, or salesman. In personality, he is tense and hard- 
driving and is often under chronic stress. Cigarette smoking, 
alcohol consumption, exposure to air pollutants, and 
extremes of air temperature and humidity are contributory 
factors. Smoking and alcohol are both considered causative 
factors in reflux, and all of the factors contribute to tissue 
irritation, which when coupled with impact stress may 
result in ulcerations. 

Musculoskeletal tension secondary to emotional stress is 
also considered to be a predisposing factor responsible for 
the mechanically damaging hyperadduction of the vocal 
folds, and some believe that ANS effects are responsible for 


vasoconstriction of the laryngeal mucosa, reducing blood 
supply to that region and thereby increasing its susceptibil- 
ity to trauma. As previously mentioned, recently this profile 
of causative factors has been expanded to include laryngeal 
esophageal reflux. 

The dysphonia of the patient with a contact ulcer is char- 
acteristically low pitched, hoarse, and grating. Not only does 
chronic vocal abuse produce contact ulcer, but sudden trau- 
matic yelling or shouting can also precipitate it. It often 
occurs after laryngitis, particularly when the patient has 
continued to abuse the voice during a throat infection. The 
following studies are typical of contact ulcer patients. 


Case Study 8.5 


A 43-year-old steel plant supervisor had an 11-month his- 
tory of laryngitis, during which he developed a contact 
ulcer. His dysphonia had become chronic 4 months after 
he had begun working in the machine shop of his plant, 
during which time he used his voice constantly, having to 
raise it to high loudness levels to be heard over the envi- 
ronmental noise. This environment was also dusty, hot, 
and dry in the summer and cold and damp in the winter. 
Moreover, he sang in a male chorus, and he complained of 
long working hours and general fatigue. On examination, 
his voice was hoarse and low in pitch with some acoustic 
evidence of breathy air escape. 


A 41-year-old professor of English literature had a 
5-year history of episodic hoarseness in association with 
lecturing, which became worse during resumption of a 
regular teaching schedule and assumption of new duties 
as a curriculum supervisor. Indirect laryngoscopy showed 
a contact ulcer granuloma on the posterior third of the 
left vocal fold. His voice was hoarse and excessively low in 
pitch and gave indications of breathy air escape. He 
described his style of lecturing as “animated” and “force- 
ful,” with a tendency to use an unnaturally low pitch. He 
perfectionistically prepared his lectures and delivered 
them under conditions of anxiety and tension. He found 
that his voice would tire toward the end of each lecture, 
and he would feel muscular tension in his neck and pain in 
his throat, sometimes expanding into headache. 


A 48-year-old automobile plant manager gave a 
2-month history of hoarseness with upper respiratory 
infection. Indirect laryngoscopy showed bilateral contact 
ulcer granulomas at the bases of both vocal folds. His voice 
was hoarse with excessively low pitch. Vocal abuse and 
emotional stress coincided with manufacturing problems, 
his dysphonia developing shortly after spending 6 hours a 
day in the shop talking over noise levels approaching 100 
dB. Much of his speaking had to do with reprimanding 
employees for inefficiency, and at the same time, he was 
regularly involved in public speaking in civic organizations. 


A 43-year-old radiologist presented with a 6-month his- 
tory of hoarseness after an upper respiratory infection. 
Video endoscopy revealed a unilateral contact ulceration 
on the left vocal process. His voice was somewhat asthenic 
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and his vocal range significantly reduced. History of vocal 
abuse was negative. He denied excessive alcohol intake 
and did not smoke. He had a positive history of esophageal 
reflux for which he was making lifestyle changes and tak- 
ing PPls (Priton Pump Inhibitors) without noticeable 
change in symptoms. 


Ventricular Dysphonia 


Closely allied to musculoskeletal voice disorders is ventricu- 
lar dysphonia, or dysphonia plicae ventricularis, character- 
ized by a continuously strained, harsh, or hoarse low- 
pitched voice described as rattling, rumbling, cracking, and 
ticker-like. During ventricular dysphonia, both airflow and 
frequency are abnormally low and the true vocal folds can- 
not be visualized. To some, ventricular dysphonia gives the 
impression of phonating during strangulation, and to others 
it has a groaning, animal quality suggesting exertion of 
extreme effort. The voice disorder derives its name from the 
fact that voice is produced by vibration of the false or ven- 
tricular vocal folds rather than the true folds. Its etiology is 
often unclear. Because it has been noted to occur in muscu- 
larly tense people and also because it is extreme hyper- 
function, it is probably best classified as a musculoskeletal 
tension disorder. It is important to note that many patients 
who have organic disease of their true vocal folds, which are 
incapable of producing adequate sound, compensate by 
using their ventricular folds instead. 


Conversion Voice Disorders 


This group of psychogenic voice disorders originates from 
the psychoneurosis known as a conversion reaction or con- 
version disorder. A conversion disorder is any loss of volun- 
tary control over normal striated muscle or over the general 
or special senses as a consequence of environmental stress 
or interpersonal conflict. The criteria for conversion reac- 
tions are that they: 


are specific physical symptoms or syndromes that cannot 
be traced to any anatomic or physiologic disease 


è are unconscious simulations of illness, which the patient 
is convinced is of organic origin 


serve the psychologic purpose of enabling the patient to 
avoid awareness of emotional conflict, stress, or personal 
failure that would be emotionally intolerable if faced 
directly 


è can occur in any sensory or voluntary motor system 


Examples of sensory conversions are loss of general sensa- 
tion in response to touch, pressure, or pain, or impairment of 
the special senses of vision or hearing. Motor conversions 
take such forms as weakness, incoordination, complete loss 
of movement control (“paralysis”), or unusual or bizarre 
movements anywhere in the body. 

The word conversion was first used by Freud to explain a 
mechanism by which an unbearable idea is rendered 
innocuous by having its energy transmuted into some bodily 


form of expression (i.e., conversion is a theoretical concep- 
tion of a clinical finding). Sensory or motor loss in the 
absence of organic disease is a defense mechanism against a 
threat from the environment. It is conjectured that somehow 
the psychic energy generated by that threat is transmuted or 
converted into a somatic sign, and, as in conversion voice 
disorders, the somatic sign is symbolically related to certain 
specifics about the threat or conflict that created it. 

The idea that psychic energy can be transformed into 
physical or “organic” dysfunction is difficult to grasp. Freud’s 
theory of “libidinal” conversion to physical dysfunction has 
been the subject of much criticism because of its metaphor- 
ical quality. It becomes more acceptable, however, if mod- 
ernized to mean symbolization of the conflict or unbearable 
idea. In their contemporary interpretation of the meaning of 
conversion reaction, Ziegler and Imboden (1962) wrote the 
following clarifying explanation: 


We have found it useful to consider the patient with a con- 
version symptom as someone enacting the role of a person 
with “organic” illness, symbolically communicating his dis- 
tress—dysphoric affect and/or unacceptable fantasy—by 
means of somatic symptoms. In our conceptual model, this 
somatic mode of communication does not serve to “dis- 
charge” pent-up emotion but, rather like any other language, 
it is useful as an instrument in negotiating interpersonal 
transactions. Through the conversion reaction, the fact that 
the patient is in distress is formulated to himself and com- 
municated to others in the egosyntonic terms of “physical 
illness,” and the patient thereby distracts himself (with 
varying degrees of success) from the more immediate per- 
ception of his dysphoric affect. Human beings may commu- 
nicate their feelings and ideas to themselves and others in a 
variety of modes such as ordinary consensual language, 
sign language, dreams, autistic verbal symbols of schizo- 
phrenia, or autistic somatic symbols of conversion reaction. 
Conversion may be viewed as operating, in this way, like 
other psychological processes. In many conversion reac- 
tions seen clinically, the message of emotional distress is 
communicated primarily in nonverbal ways (to the patient 
himself as well as the others in his transactional field) in 
terms of somatic symptoms which are, in effect, an analogic 
code. The patient unconsciously chooses particular symp- 
toms according to his conception of illness, as derived from 
his own past experiences with illness or from his observa- 
tion of others. His particular symptoms will then simulate 
physical illness in a relatively expert or relatively crude 
manner, depending upon the degree of congruity between 
his imagery of illness and that of the observer. Within this 
context of unconsciously simulated illness, specific symp- 
toms may develop or receive prominence because they are 
especially suited to the symbolic representation of specific 
fantasies, affects, and motivational conflicts. 


Sometimes hysteria is used in place of conversion reaction 
(e.g., hysterical voice disorder, hysterical aphonia, or hysterical 
dysphonia), in which case hysteria and conversion are used 
interchangeably and synonymously. Technically, this failure 
to differentiate the two terms is incorrect. Conversion refers 
to somatization of an emotional conflict, as elaborated previ- 
ously, whereas hysteria refers to a personality type or behav- 
ior pattern. Criteria for hysteria are (1) an immature or 
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egocentric person who has a propensity to develop subjec- 
tive complaints involving various parts of the body, prompt- 
ing that person to consult many physicians and to submit to 
a succession of operations. (2) In the case of a female, one 
who is seductive or flirtatious, sometimes hostile or manip- 
ulative, and who accents or highlights her sexuality by her 
manner, voice, dress, and gait, thus revealing an insecurity in 
this sphere. (3) A suggestible, dependent, shallow person, 
dramatic and theatrical in manner and labile in affect, who 
takes cues from what seems to be expected according to the 
environment and who plays a series of superficial roles in 
relating to other people. Many individuals satisfy one or 
more of these criteria for hysterical personality but do not 
have a conversion reaction. Conversely, many patients with 
conversion reaction bear none of the personality or behavior 
traits of the hysterical personality. 

A conversion voice disorder (1) exists despite normal 
structure and function of the vocal folds, (2) is created by 
anxiety, stress, depression, or interpersonal conflict, (3) has 
symbolic significance for that conflict, and (4) enables the 
patient to avoid facing the interpersonal conflict directly 
and extricates the person from the uncomfortable situation. 
The onset of the voice disorder is almost always associated 
with emotional conflict. For many years, the misconception 
abounded that the only voice sign of conversion was whis- 
pering or aphonia. However, conversion voice disorders 
come in many forms: muteness, aphonia, and less dramatic 
forms of dysphonia, such as breathiness, hoarseness, 
falsetto pitch breaks, and continuous falsetto. The multiplic- 
ity of voice signs of conversion reaction has been substanti- 
ated by Aronson, Peterson Jr., and Litin (1966), who found 
identical criteria for conversion reactions in patients whose 
problems ranged from muteness to varying degrees and 
types of dysphonia. 


Conversion Muteness 


The most extreme and incapacitating conversion voice dis- 
order is muteness or mutism, in which the patient neither 
whispers nor articulates, or may articulate without exhala- 
tion. Entering the room with notebook and pencil, they 
write their questions and answers, and although unaware 
of what they are revealing, involuntarily cough, showing 
their normal vocal fold adduction. The patient will also 
cough sharply on request without being aware of the incon- 
gruity between their normal cough and manifest inability 
to phonate. Patients who present with one form of conver- 
sion voice disorder very likely have experienced other 
forms. 

Common findings in such patients are chronic stress, pri- 
mary and secondary gain, indifference to their symptom, 
other manifestations of conversion, poor sex identification, 
suppressed anger, immaturity and dependency, neurotic life 
adjustment, and mild to moderate depression. Different as 
these individuals are from one another, a common denomi- 
nator runs through their histories: (1) a breakdown in com- 
munication with someone important to that person; (2) a 
conflict between wanting but not allowing oneself to express 
anger, fear, or remorse verbally; (3) fear or shame standing in 
the way of expressing feelings via conventional speech and 
language. A typical case follows: 


Case Study 8.6 


A 44-year-old female presented with no use of speech 
structures for communication. All her responses were writ- 
ten. Her history was one of struggle and deprivation from 
birth. She was the youngest of 10, her family being bur- 
dened by five other children from her father’s former mar- 
riage. By the time she was 20, her father then dead and 
most of the children gone, she had been saddled with the 
responsibility of caring for her aging mother and two older 
sisters. She lived a Cinderella-like existence, doing the 
household chores and being dominated by her sisters, 
who rarely allowed her out of the house. Under these cir- 
cumstances, at age 23, she had her first conversion symp- 
tom, a “paralysis” of the right arm lasting 2 weeks. Three 
years later, she became severely depressed and unable to 
work for several months. Then, at age 27, her mother, “the 
only person who ever loved me,” died. Shortly thereafter, 
she developed a “paralysis” of both legs and became an 
invalid for 1% years. Physical therapy effected a “dramatic 
cure.” Her doctors, who knew of her hostile and depend- 
ent relationship with her sisters, urged her to get out on 
her own. Six years later, at age 35, she married a 27-year- 
old man with a son from a previous marriage. She had 
made an adjustment to life. Financial worries and behavior 
problems in the son were followed by episodes of dyspho- 
nia. During the winter prior to the current interview, she 
had many episodes of intermittently phonated-whispered 
voice. Her current episode had begun 9 months before 
with a period of whispered speech lasting 7 months and 
finally giving way to muteness and writing notes. 


Conversion Aphonia 


Conversion aphonia refers to involuntary whispering despite 
a basically normal larynx. Indirect laryngoscopic examina- 
tion indicates normal or partial adduction of the vocal folds 
on vowel production or coughing. Even without laryngo- 
scopic examination, normalcy of the vocal folds can be heard 
as the patient coughs and makes other glottal sounds unas- 
sociated with speech. Similar to dysphonia associated with 
musculoskeletal tension, the larynx in conversion aphonia is 
often elevated in the neck along with the hyoid bone, and the 
entire laryngeal-hyoid sling is rigid and is difficult to move 
manually in any direction. 

Within the category of aphonia falls a considerable variety 
of whispers: pure or noiseless; harsh, sharp, or piercing; 
intermittent high-pitched squeaks and squeals; moments of 
normal voice. The sharpness of the whispering indicates that 
the intrinsic laryngeal muscles are in a state of hypercon- 
traction, even though the vocal folds are prevented from 
approximating. 

Approximately 80% of patients with conversion aphonia 
are female. Many have had previous episodes of aphonia or 
dysphonia that have spontaneously cleared. Onset can be 
sudden, within seconds or minutes, or over a period of 
hours, beginning with hoarseness that turns into aphonia. 
Conversion aphonias and dysphonias are often triggered by 
colds or flu and associated laryngitis. Upon recovery from 
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the upper respiratory infection, the dysphonia remains, 
worsening to aphonia. Often, onset is associated with 
fatigue or exhaustion. Discomfort, pain, and tightness in the 
larynx and upper and lower neck and chest regions are com- 
mon in the conversion group, as they are in the muscu- 
loskeletal tension group. As in conversion muteness, 
aphonic patients give histories including either acute or 
chronic emotional stress, symbolic significance of the voice 
loss, primary and secondary gain, a tendency toward indif- 
ference to the incapacitating effects of the voice, and con- 
version reactions in other regions of the body at different 
times in their lives. Additional findings are emotional 
immaturity, neurotic life adjustment, and mild to moderate 
depression. 

Although the following studies are of patients with con- 
version aphonia, they could apply equally to patients who 
are mute or to those who are dysphonic. 


Case Study 8.7 


A 14-year-old girl’s parents were separated and involved 
in divorce litigation. She had markedly ambivalent feel- 
ings toward her father. Her latest episode of aphonia 
occurred an hour after a violent argument with him on the 
telephone over her request for Christmas money. 


A 58-year-old housewife learned of her brother’s death. 
En route to his funeral, she received word that her sister- 
in-law had died. She awoke the next morning with apho- 
nia that continued for the next 2 years. 


A 31-year-old secretary and mother of three had been car- 
rying on an extramarital affair for 9 years and was fearful 
that this relationship would be discovered. During the 
past year, she had been having disagreements with her 
employer, and her fear of being fired was the only factor 
that kept her from “telling him off.” Her aphonia occurred 
shortly after her boyfriend lost his job and moved to anew 
location, preventing her from communicating with him; 
she had begun to consider breaking off their relationship. 


A 26-year-old police officer was not happy in his work but 
would not tell anyone of his feelings, particularly his par- 
ents, for fear of disappointing them, as they had encour- 
aged him to enter police work. He had always been quiet, 
nonaggressive, and reluctant to lose his temper. He had 
had previous episodes of aphonia and dysphonia that had 
cleared spontaneously, but this most recent one failed to 
improve over a 3- to 4-week period. The main reason for 
his dislike of being a policeman was an inability to exert 
authority—a fear that he would say the wrong thing or 
reap dislike or disapproval from those he tried to repri- 
mand. He would find himself becoming nervous and 
upset when having to give a motorist a ticket; if the 
motorist argued with him, he would shake and get mixed 
up. He had no idea that his voice losses were related to his 
attitudes toward his work until after symptomatic voice 
therapy and discussion with a psychiatrist, when he began 
to realize that his voice loss was an attempt to change 
jobs. He finally came to realize that he must tell his family 


about his feelings and return to his previous job, which 
was that of a dyer in a textile mill, where he could be alone 
with his thoughts. 


We are reminded of an important personality characteris- 
tic of patients with conversion voice disorders: their diffi- 
culty in dealing maturely and openly with feelings of anger. 
The following exchange illustrates the point: 


Case Study 8.8 


Clinician: You would say, then, that you are not very good 
at telling people how you feel. 

Patient: Certainly not. | look calm to everybody, but I’m 
not. When something bothers me | get it right here 
(motions to her throat) and | just tighten up. 

C: You’re not the type that can talk back and let off steam. 

P: (Smiles) With my voice the way it is? Why, that’s impos- 
sible. 


Here is a more detailed case study of a 33-year-old, mar- 
ried woman who went to her otolaryngologist after becoming 
aphonic. The case illustrates the sheer depth and complexity 
of human problems that can underlie voice loss, the impor- 
tance of not approaching aphonias and dysphonias as voice 
disorders only but rather approaching treatment of the 
total person, and the importance of delving into the psy- 
chosocial history to obtain the underlying etiology of the 
voice disorder. 


Case Study 8.9 


Otolaryngology: This 33-year-old woman has a history 
of recurrent sinusitis. Earlier this year, she noted sore 
throat and sinus drainage. She has been unable to speak 
normally for a week, along with difficult and painful 
swallowing, reduced appetite, tinnitus, and dizziness 
when she blows her nose. Her vocal folds are inflamed 
on examination. Impression: Severe upper respiratory 
infections with bronchitis and laryngitis. Her dysphonia 
seems out of proportion to the physical findings. She 
has a good cough. Psychogenic aphonia. See Speech 
Pathology. 

Speech Pathology: History. This 33-year-old, married 
female with two boys aged 4 and 9 years was referred 
from the Department of Otolaryngology because of sus- 
pected psychogenic aphonia. She says she lost her voice 
13 days ago. She awoke with a “scratchy” voice, which dis- 
appeared within an hour, and she has been whispering 
ever since. She had one similar voice loss for 2 days when 
she was 16 years old. 

Examination: The patient was aphonic on examination. 
However, her cough and coup de glotte were normally 
sharp. She had +2 musculoskeletal tension and discom- 
fort in the thyrohyoid region in the response to palpation. 
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Diagnostic-therapeutic musculoskeletal tension reduc- 
tion in the laryngeal-hyoid region produced completely 
normal voice within 15 minutes. 

The psychosocial history portion of the examination 
disclosed serious and chronic emotional problems, which 
the patient described with considerable crying. 


1. She has been desperately unhappy living in a small 
town where she feels completely isolated with no one 
to talk to. 


2. She feels overburdened having to care for her two chil- 
dren, the younger of which is exceptionally active and 
difficult to control. 


3. She and her husband fight a great deal over material 
possessions and about one another’s parents. They 
rarely go out together. He often leaves her alone when 
he goes hunting and fishing. 


4. She complains bitterly about her husband’s parents, 
who dislike her intensely and do not talk to her when 
they see her in public. 


5. She is estranged from her own parents, whom she 
describes as eccentric and crude. (It is important to 
note that she lost her voice on the day of an impending 
visit from her parents.) 


6. She cries most intensely while describing her sister’s 
suicide 5 years ago. She has never spoken to anyone 
about her feelings concerning her sister’s death. 


7. She is embarrassed and fearful that her husband will 
continue to criticize her as being like her sister, and she 
expresses misgivings as to whether or not she might 
have the capacity to commit the same act. 

Impression: Psychogenic aphonia, probably conversion 
disorder. 


Recommendation: (1) Normal voice followed sympto- 
matic voice therapy, as noted. (2) Discussed with patient 
and husband the importance of psychiatric consultation. 
They agreed. 

Psychiatrist: This 33-year-old, married mother of two 
sons was referred for psychiatric consultation by Speech 
Pathology. 


She described developing her difficulty with her voice 
the day she received news that her father had invited him- 
self and her mother to stay with her and her husband for an 
indefinite time while her husband went on a hunting trip. 
Throughout the initial part of the interview, the patient 
repeatedly returned to her father’s physical and verbal 
abuse and his critical and demeaning nature throughout 
her entire life. On more recent visits, he has become sexu- 
ally inappropriate toward her. She describes several other 
recent sources of life stress and perceives herself to be irri- 
table and unhappy during the past 5 years, since her 
younger sister’s death by suicide. Since then, she has been 
more argumentative and difficult to get along with. Other 
complaints are social and emotional isolation, finding it dif- 
ficult to make friends and to get babysitters for her two 
children. She has a poor relationship with her in-laws, 
whom she perceives as mean to her and judgmental and 





unsupportive of her throughout her marriage, particularly 
after the birth of her first child, who had a congenital leg 
deformity. Recently, she impulsively and angrily confronted 
them. When they responded defensively in anger, she saw 
this as confirming their lack of caring and their meanness. 

She is frustrated within her marriage, perceiving that 
her husband’s work prevents him from spending suffi- 
cient time at home, and she feels isolated, unhappy, and 
unsupported. 

Her family history is positive for suicide in her younger 
sister, who was diagnosed as schizophrenic and who 
spent considerable time in and out of psychiatric institu- 
tions. She has an alcoholic brother and paternal grandfa- 
ther, and her maternal grandfather committed suicide 
during the Depression. 

She believes that her upbringing was unhappy—that 
her father was abusive and uncaring and her mother pas- 
sive and dominated. Her father treated the girls in the 
family with disrespect, was demeaning and at times phys- 
ically abusive. He valued only his sons. She is angry with 
her mother, who did not defend her and her sisters from 
this behavior. She was particularly close to her younger 
sister when they were growing up and they served as sup- 
port and confidant for each other. In her mid teen years 
her sister became extremely troubled and got involved 
with a bad crowd at school and with drugs. Her behavior 
continued to deteriorate, and her sister was disowned by 
the rest of the family, whereas the patient repeatedly 
tried to rescue her and help her. This situation continued 
for several years until this young woman committed sui- 
cide. The patient relates having had to deal with funeral 
home arrangements and visitations and that no one else 
in the family attended or was supportive. She expresses 
feelings of guilt and a sense that she should or could have 
done more despite her repeated attempts to help, 
although her efforts were undone or rebuffed by her 
sister. 

The mental status examination disclosed the patient to 
be neatly and stylishly dressed and groomed, appearing 
younger than her chronologic age. The initial part of the 
interview consisted of a dramatic litany of many individu- 
als in her life who had or have been “treating me mean.” 
She became tearful while discussing her sister’s death and 
her guilt, remorse, and feelings of isolation. She was able 
to acknowledge an underlying sense of low self-esteem. 
At the end of the interview, she was less distraught, more 
sad, and more comfortable than at the beginning. She 
acknowledged that she needed to continue to talk with 
someone. There was no evidence of a thought disorder or 
gross cognitive impairment. Her voice remained normal 
throughout the entire interview. 


Impression: (1) Adjustment disorder with mixed emo- 
tional features. (2) Dysthymic disorder. (3) Histrionic 
personality traits. (4) Secondary marital discord. (5) Conver- 
sation aphonia (by history; voice normal on examination). 

Recommendation: This patient needs psychotherapy. | 
will look into different treatment options that would be 
available for her. 
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Conversion Dysphonia 


Varying degrees and types of hoarseness—with and without 
a strained-harsh quality, high-pitched falsetto breaks, 
breathiness, intermittent whispering with moments of 
breathy and normal voice, and other variants too numerous, 
diverse, and indescribable to mention—occur for the same 
psychodynamic reasons as muteness and aphonia. A study 
undertaken by Aronson, Peterson Jr., and Litin (1966) com- 
paring mute and aphonic patients with those having varying 
forms of dysphonia, conducted to determine similarities and 
differences in the etiology of their voice signs, showed that 
this dysphonic group was not essentially different in terms 
of case history, personality, and clinical criteria for conver- 
sion reaction from mute or aphonic patients (Table 8.1). 

An important finding among all these patients, regardless 
of voice type, is that despite the psychoneurotic explanation 
for their voice signs, few have incapacitating psychiatric dis- 
turbances. In many ways, they have adjusted to their anxiety 
or depression. Many are willing to continue as they are 
rather than submit to any therapy. Others are earnest in their 
desire for a better voice. 


Psychogenic Adductor Spasmodic Dysphonia 


As described elsewhere in this text, certain patients who 
have an intermittently strained voice, or complete voice 
arrests from hyperadduction of the vocal folds during con- 
textual speech, descriptively called adductor spastic dyspho- 
nia, have excess laryngeal musculoskeletal tension caused by 
psychologic stress. Some of these patients have conversion 
reaction, in which the adductor spastic dysphonia is pro- 
duced by the same mechanisms that produce conversion 
muteness and conversion aphonia. 

In other patients, the adductor spastic dysphonia is a sign 
of a neurologic disorder, and that is the reason why proper 
differential diagnosis between the psychogenic and neuro- 
logic types of spastic dysphonia is so important. Following is 
a case study of a young man who developed psychogenic 
adductor spastic dysphonia, which in all probability was a 
conversion reaction. 


Case Study 8.10 


History: A 29-year-old, single, male office worker was 
referred by Otolaryngology for the evaluation of dyspho- 
nia. The patient had complained for several years of an 
abnormally strained voice. However, laryngologic exami- 
nation failed to demonstrate any structural lesions or 
weakness of the vocal folds. He was referred to Speech 
Pathology for further investigation of the cause of the 
voice disorder and for therapy. 

Examination: This patient has a moderate to severe 
adductor spastic dysphonia consisting of intermittent 
waxing and waning of strained hoarseness and moments 
of complete voice arrests due to adductor laryngospasms. 
Vowel prolongation is clear some of the time but inter- 
rupted by varying degrees of strained hoarseness. The 
remainder of the oral-physical and speech examinations 
failed to disclose evidence of motor-speech disorder. 


The psychosocial history background to the voice disor- 
der is highly positive. The patient’s voice problems began 4 
years ago as whispered speech, which lasted 3 years, 
except for a 10-month interval during which his voice 
returned to normal. When the abnormal voice reappeared, 
it had the intermittently strained voice quality identical to 
his voice on examination. He has no family history of voice 
disorder or neurologic disease, including tremor. 

The patient is a college graduate who majored in busi- 
ness administration. He is unmarried and lives with his 
parents. He admits to several problems in his emotional 
and personal life. 


è He has a history of “fears” that he has difficulty 
describing. They emerge in the form of nightmares 
and sudden unpredictable terrors triggered by every- 
day incidents. For example, he developed a “great 
fear” a few years ago upon viewing a wax figure in a 
museum that made him feel as if he were “losing con- 
trol.” For this, and other reasons, the patient has been 
undergoing psychiatric treatment. 


+ The patient expresses considerable ambivalence 
about living with his parents. He enjoys the comforts 
of familiar surroundings, being waited on by his par- 
ents. At the same time, he is irritated with himself and 
with them because of his dependency. He does not 
wish to hurt his parents’ feelings, but he is chronically 
irritated by his father’s continuous, perfectionistic crit- 
icism of his habits—how he conducts his life, reminding 
him to straighten his tie and to get his hair cut, and 
refusing to allow him to use the family car. His mother, 
whom he describes as frequently irritable, badgers 
him to get married, which he would very much like to 
do, but he has always had difficulty establishing com- 
patible relationships with women. He describes his 
mother as “nagging” and his father as “intolerant.” 


The patient moved out of his parents’ home for 9 
months, during which his spastic dysphonia disappeared. 
He lived in an apartment, by himself, a few blocks from his 
parents, but he felt the need to visit them every night. He 
then moved back into his parents’ apartment, whereupon 
his spastic dysphonia promptly returned. 

During a discussion with the patient and his parents, 
they corroborated their son’s description of the family 
relationships, his father saying, “He is afraid to get out on 
his own. He is overdependent upon us.” 

The patient, for his part, and with some irritation, 
rejected the interpretation that his voice disorder had 
anything to do with his ambivalence about living at home, 
although later he reluctantly admitted that there might 
be some connection. Nevertheless, he continued to ask 
questions revealing that he was looking for an organic 
explanation for his spastic voice. 


Impression: Psychogenic adductor spastic dysphonia, 
possibly conversion disorder. His initial voice problem 
sounds like he had conversion aphonia, and now he has an 
adductor spastic dysphonia. Both the aphonia and the 
spastic dysphonia disappeared for long periods, only to 
return. Organic (neurologic) forms of spastic dysphonia 
do not remit to this extent. 


ESL 


Table 8.1 Psychiatric Characteristics of 27 Patients with “Functional” Voice Disorders 


































































































































































































Group I Group Ill Patients 
Patients Group II Patients (Intermittently Group IV Patients 
(Mute) (Continually Whispered) Whispered-Phonated) (Continually Phonated) Total 
Psychiatric Bp sh a) Set 
Factor 1 2S ee ee 6n 7 ee See | en 2 ee a249 10 11 12 13 25 26 14 #15 16 17 18 19 20 27 + = 
Acute stress H H H H H 10 17 
Chronic stress + + = + H H H ar = = + 13 14 
Conflict H H H H H H H 15 12 
symbolism 
Primary gain H H H H H 13 14 
Secondary gain qe AE H + ar ap H ar = ar = + 17 10 
Indifference H H H H H H H H 17 10 
Clinical hysteria + + + + H + a = 2 ib 
Other conversions + H H 8 19 
Somatic F Ar F H + F = = 7 20 
complaints 
MMPI conversion ap aF + H F = F 9 18 
Marital discord H H H H : H 10 17 
Poor sex de Ea Ho ap 4 da A SE IE) 
identification 
Anger + + + + + + + e O A 2 5 en 
Immature } } } } } } } } H H 18 9 
dependent 
Neurotic life H H H H H H H H H 16 11 
adjustment 
Mild to moderate H F H H a H H 7 20 
depression 


Source: Aronson, A.E., Peterson, H.W., Jr., Litin, E.M. (1966). Psychiatric symptomatology in functional dysphonia and aphonia. J Speech Hear Disord 31, 115-127. 
MMPI, Minnesota Multiphasic Personality Inventory. 
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The conflict in which this patient is trapped has to do with 
his need for his parents’ support; however, for this he has 
had to accept being cast in the role of an adolescent by his 
parents’ continuation of their lifelong overconcern and 
overcriticism of him. He is unable to separate from his par- 
ents and is chronically angry at their overcontrol, but he is 
unable to vent his anger at them for fear of disturbing them 
and upsetting the domiciliary arrangements that he enjoys. 
His spastic dysphonia may be seen as a somatic manifesta- 
tion of his inability to express his true feelings to his parents. 


Recommendations: During an extensive and frank dis- 
cussion with the patient and his parents, during which the 
probable underlying dynamics of the voice disorder were 
discussed, it was reemphasized that the voice disorder 
was not caused by organic illness and it was strongly sug- 
gested that the patient return to his psychiatrist for more 
intensive psychotherapy with the objective of helping him 
to become more self-reliant and less dependent on his 
parents for emotional and physical support. A full report 
will be sent to the patient’s psychiatrist. 


Mutational Falsetto (Puberphonia) 


The problem of mutational falsetto, or puberphonia, revolves 
around the subject of the pubertal change of the voice. What 
we are concerned with here are the physiologic changes of 
the male and female anatomy associated with pubertal and 
adolescent voice change, a subject inseparable from the dis- 
order of mutational falsetto, or puberphonia. The mutational 
period of human development represents a dramatic physi- 
cal and emotional transformation in the life of the individual. 
By way of review, we have noted in Chapter 2 that the prin- 
cipal changes that take place during puberty are (1) a con- 
siderable increase in vital capacity secondary to an increase 
in the size and strength of the thoracic musculature, (2) an 
increase in the length and width of the neck, and (3) a 
descent of the larynx producing a greater length and width 
of the pharynx thus enlarging the resonatory system. The 
length of the male vocal folds increases by approximately 
1 cm and the female ~3 to 4mm. 

The basic difference between the pubertal development of 
the male and the female larynx has to do with the direction 
of their growth. Until puberty, they are essentially the same 
in size and form; however, during pubertal development, the 
male larynx grows especially in the anteroposterior direc- 
tion leading to the protrusion of the thyroid eminence also 
known as the pomum Adami, or Adam’s apple. In general, 
the female larynx increases in height greater than in width, 
the result in both sexes being a deepening of the voice and 
greater vocal power. 

Furthermore during mutational voice change, the mucosa 
becomes stronger and less transparent; the tonsils and ade- 
noids atrophy; and the epiglottis increases in size, generally 
flattens out, and assumes a more elevated position. 

The timetable for mutation of voice begins with premuta- 
tional changes consisting of a loss of phonatory power, a slight 
lowering of pitch, and breaks in the voice. Voice mutation in 
females starts as early as 9 years of age and in males around 


10. The thymus gland diminishes and disappears. The second- 
ary sex characteristics develop, the function and size of the 
thyroid gland increases, as well as the hypophysis. In males, 
there is a considerable increase in respiratory capacity; both 
the length and circumference of the chest increase rapidly 
providing greater driving force. The above-mentioned length- 
ening of the neck and relative descent of the larynx are associ- 
ated with deep voices. As has been noted in an earlier chapter, 
there are racial, geographic, and climatologic determinants of 
the advent of mutation occurring earlier in city dwellers, in 
better educated children, and in warmer climates. Girls’ voices 
tend to change about 1 year earlier than boys’ and the change 
is less dramatic. In temperate climates, the majority of girls 
begin mutation at 12 to 14 years of age, boys 13 to 15, with 
greater acceleration nearer the Equator and some retardation 
of the onset of mutation at the North and South Poles with 
exceptions to this rule. The average duration of mutation is 3 
to 6 months. The so-called break in the voice during pubertal 
change consisting of a sudden, involuntary change in pitch 
and quality is not typical of most boys’ voices and is more a 
caricature than anything else. It is probably true that only a 
minority of young boys experiences the voice “break.” In a 
study by MacKenzie (1871), only 17% of choir boys showed 
breaks of their voices, and the phenomenon is uncommon in 
girls. In the literature, the majority of writers suggest that the 
causes of the break are sudden and alternating contractions of 
the cricothyroid muscle and the thyroarytenoid muscle. The 
pubertal voice change begins with a huskiness and unsteadi- 
ness of the speaking voice, which may show vacillations of 
one to two tones, also known as the “forme frust” of the break. 
The general trend of mutational voice change is a downward 
direction of pitch. It is generally agreed that boys’ speaking 
voices become one octave lower and girls’ one or two tones. 
Investigations of pubertal change of the singing voice have led 
to the agreement that the lower limit of girls’ voices descends 
one or two tones and the upper limit may gain the same. The 
lower limit of boys’ voices descends a full octave. The transfor- 
mation of the boys during mutation seems to take the follow- 
ing direction: a gradual lowering of the lower limit; with 
steadiness of the higher tones, the low notes slowly become 
steady (main phase); the stabilization of the low notes marks 
the end of the mutational period. 


Pathologic Mutation of Voice 


Abnormal vocal mutation can be subdivided into organic and 
psychogenic etiologies. Although the symptoms may be sim- 
ilar, the etiologies differ with the organic haring an identifi- 
able physical component. The emphasis in this chapter is on 
the nonorganic mutations. 
Classification of Mutational Disturbances 
Organic 
1. Endocrinologic 
a. Persistent child’s voice 
b. Delayed mutation 


c. Precocious mutation 
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Psychogenic 
1. Mutational falsetto 


2. Incomplete mutation 


Mutational Falsetto This is the most frequently diagnosed 
mutational disturbance. The mutational falsetto voice is con- 
spicuously high and casts doubt on the masculinity and 
maturity of the speaker. Not only caused by problems of sex- 
ual identification, mutational falsetto can result in psycho- 
logic disturbances because of its effect on others. 

The laryngoscopic finding in mutational falsetto is one of 
an elongated elliptical opening of the glottis typical of nor- 
mal speaking males. 


Incomplete Mutation This disorder consists of incomplete 
or insufficient lowering of the boy’s voice, which may be 
weak, breathy, and have a definite effeminate quality, espe- 
cially over the telephone, a curse to its user. 

Within the context of voice mutation and mutational 
voice disorders, it is reasonable and appropriate to bring up 
the history of the eunuch and the castrato. The word eunuch 
comes from the Greek eunochos and the Latin eunuchus 
meaning “keeper of the bedchamber,” one of the eunuch’s 
many functions. Men have been castrated from the time of 
antiquity for several reasons: (1) as a consequence of being 
taken prisoner during times of war; (2) for the salvation of 
the soul. In certain Christian sects, men have castrated 
themselves both testes and penis to suppress temptation; 
(3) for purposes of revenge. In the Orient, a vengeful hus- 
band could demand in court the castration of his wife’s 
lover; (4) to produce male opera sopranos, a subject upon 
which we will elaborate later. In post-Renaissance Italy, 
choirboys were castrated so that these youths and adults 
could sing as full-bodied sopranos in the cathedrals of 
Europe. Although the Catholic Church discouraged the 
operation, the Vatican’s interest in emasculated men for its 
choirs maintained the castrati into the 20th century; (5) in 
Rome, handsome men were castrated to become playthings 
or “boy toys” of wealthy matrons who could copulate with- 
out fear of pregnancy; (6) as guardians of harems. In the 
East, eunuchs served as the protectors of sultans’ harems; 
(7) as punishment for rape. Through history, castration was 
employed against rapists; for example, by the ancient Greeks, 
who used castration to punish rapists. The offender was 
called a spados meaning “to draw out” or “drag,” a descrip- 
tion of how the testes were removed from the scrotum. 
These men were despised in Greek society and refused 
employment. These men who had their testes “dragged” 
often masqueraded as women, giving rise to the use of the 
word drag for a man dressed in women’s attire; (8) as noted 
above, during the pre-Christian era, the Roman eunuch was 
fittingly called a volupatas, Latin for “pleasure,” and the ori- 
gin of the word voluptuous for the soft, rounded, feminine 
distribution of the eunuch’s body fat. Eunuchs became the 
main attraction in the orgies given by the emperors 
Octavius, Tiberius, and Caligula, who made no secret of 
their fondness for castrated men. The satirical poet Juvenal, 
speaking of Roman women who preferred the flower of 
marriage and not the fruit, wrote “some women always 


delight in soft eunuchs and tender kisses and in the 
absence of a beard and the fact that the use of abortives is 
unnecessary.” 

Eunuchs were even more popular among Roman men. The 
2nd century Greek satirist labeled homosexuality with 
eunuchs as a cause of the corruption of Roman morals by 
tempting everyone from emperors to generals. 

Castration has played a significant role in the history of 
Christianity. The Christian emperor Constantine reduced the 
number of eunuchs when he ordered capital punishment for 
castrators. 

The soprano castrato voice is the product of the following 
anatomic changes that occur as a consequence of castration. 
To begin with, during puberty in the male, the vocal folds 
increase in size as a response to the secretion of testosterone 
from the gonads, and the neck lengthens increasing the size 
of the pharynx and also increasing the thoracic dimensions. 
By removing the gonads between the ages of 7 and 10, the 
boy’s soprano never deepens to the baritone or bass, and 
with increased vital capacity and increases in the size and 
strength of his respiratory musculature, his thin soprano 
blossoms into a large expansive soprano that no female 
could produce. For centuries, the Catholic Church in Rome 
was the major user of castrated sopranos, employing them in 
the Vatican Chapel choir until the practice of castration was 
condemned by Pope Leo. One Italian writer described the 
castrati soprano as follows: 


What singing! Imagine a voice that combines the sweet- 
ness of the flute and the animated suavity of the human 
larynx—a voice that leaps and leaps, lightly and sponta- 
neously, like a lark that flies through the air and is intoxi- 
cated with its own flight; and when it seems that the voice 
has reached the loftiest peaks of altitude, it starts off again, 
leaping and leaping still with equal lightness and equal 
spontaneity, without the slightest sign of forcing or the 
faintest indication of artifice or effort; in a word a voice 
that gives the immediate idea of sentiment transmuted 
into sound, and of the ascension of a soul into the infinite 
on the wings of that sentiment. What more can I say? 


This method of castration going back to antiquity was 
practiced in the Middle East to obtain a eunuch for the 
harem or to permanently punish a rapist: the male was 
strapped, arms and legs, to a table, a thin cord was knotted 
around his genitals, and with a sharp razor, the organs were 
amputated. The wound was cauterized by the application of 
a searing hot poker or molten tar. The mutilated youngster or 
adult was deprived of water for several days to prevent uri- 
nation, which could infect the healing region. Then, he was 
forced to drink enormous amounts of water, until the pres- 
sure in his bladder literally pushed the urine through layers 
and layers of scar tissue, providing him with an orifice. The 
mortality rate was perhaps 80%. If he survived the operation, 
he began his new job as a docile, sexually unthreatening 
slave. 

Clearly, the castrato voice is not the typical mutational 
falsetto. Mutational falsetto is not due to any anatomic 
immaturity of the larynx or vocal folds. The larynx is 
anatomically and physiologically capable of producing a nor- 
mal low-pitched voice but fails to do so. In males, the voice 
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may move upward in pitch during puberty, giving the 
impression of a female voice. This high-pitched falsetto type 
is weak, thin, breathy, hoarse, and monopitched, giving the 
overall impression of immaturity, effeminacy, and passive- 
ness. In some males and females, the change in voice pitch 
may have begun but failed to descend completely, stopping 
somewhere between the pubescent and the adolescent 
voice. The pitch is lower than the falsetto type but retains a 
similar weakness, thinness, hoarseness, and monopitch. 

The etiology of mutational falsetto is unclear because it 
has not been carefully investigated. Some attribute it to 
faulty learning and others to psychogenic causes. The indi- 
vidual and family background cannot be depicted other than 
superficially. One opinion is that the pubescent or adoles- 
cent, if male, acquired a stronger feminine than masculine 
attachment and self-identification and a neurotic need to 
resist the normal transition into adulthood. Variations along 
these themes include embarrassment about an excessively 
low-pitched voice developing earlier than the patient’s 
contemporaries, forcing the pubescent male to retain the 
high-pitched voice, or the unconscious or conscious need to 
maintain a higher pitched singing voice because of the 
rewards attendant upon that skill such as that associated 
with participation in boy choirs. 

Psychologic immaturity may not be the only cause of this 
disorder. Other factors are (1) delayed maturation in 
endocrine disorders that retards laryngeal development, per- 
petuating a high-pitched voice that then becomes difficult to 
abandon because of its longer-than-normal persistence into 
adolescence, even after the larynx has attained normal size; 
(2) severe hearing loss preventing the individual from per- 
ceiving his or her voice during adolescent voice change; (3) 
weakness or incoordination of the vocal folds or of respira- 
tion because of neurologic disease during puberty; (4) gen- 
eral debilitating illness during puberty, which not only may 
delay overall growth during puberty but, because of the phys- 
ical restrictions of being bedridden, may reduce the range of 
respiratory excursions and, consequently, tidal air volumes, 
preventing the development of adequate infraglottal air pres- 
sure necessary for full vocal fold displacement. 

The following laryngeal-respiratory postures and move- 
ments are bases for the high-pitched mutational falsetto 
voice (Fig. 8.1). 


+ The larynx is elevated high in the neck. 
+ The body of the larynx is tilted downward. 


+ The vocal folds are stretched thin by contraction of the 
cricothyroid muscles. 


Respiration for speech production is shallow, and on 
exhalation infraglottal air pressure is held to a minimum, 
so that only the medial edges of the vocal folds vibrate 
and do so at an elevated fundamental frequency. 


Mutational falsetto is found in patients of all ages, some as 
young as 14 or 15 years and brought by their parents, and oth- 
ers volunteering themselves for help later in life after a long 
struggle with this social and psychologic handicap of a voice 
that does not match gender or sounds infantile. Almost all 
those seeking help for the first (or 10th) time in their 50s or 
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Figure 8.1 Comparison of normal laryngeal position with that of 
mutational falsetto. (A) Normal phonation. Note relatively low position 
of larynx, ample thyrohyoid space, blunt edges of vocal folds, and high 
infraglottal pressure. (B) Mutational falsetto. Note relatively high posi- 
tion of larynx, narrowed thyrohyoid space, thin edges of vocal folds, 
and low infraglottal pressure. 


60s testify that their voices have broken into deeper or lower 
pitch levels during shouting or lifting. Characteristically, they 
fail to recognize the message of normalcy that these down- 
ward pitch breaks reveal. The falsetto voice is the only one 
they know. Producing a sharp glottal attack with or without 
the examiner exerting downward force on the larynx during 
phonation elicits the low-pitched voice. The psychologic 
penalties of mutational falsetto are great, labeling the male 
bearer as feminine. Use of the telephone is particularly 
embarrassing, as callers misinterpret the male with falsetto 
as being female. Teasing of young males in school and rejec- 
tion for employment are additional problems. 

Clinicians often miss the diagnosis of mutational falsetto 
because of unfamiliarity with the syndrome. Yet it is one of 
the easiest and most rewarding disorders to treat. Familiarity 
with the distinctive sound of mutational falsetto and the 
method of testing for it should be mandatory in the training 
of all speech pathologists and laryngologists. 
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Transsexualism Changing one’s anatomy to assume the 
opposite sexual role is called transsexualism. The majority of 
such changes are from male to female. Such individuals 
need, in addition to an altered physical appearance, speech 
that conforms to the chosen sexual identity. The effects on 
the larynx of the administration of female hormones for the 
enhancement of female characteristics are apparently mini- 
mal. The successful elevation of fundamental frequency in 
transsexuals as a consequence of voice therapy has been 
reported by Bralley, Bull, Gore, and Edgerton (1978). These 
researchers were able to elevate fundamental frequency in 
one patient from 145 to 165 Hz after seven therapy sessions. 
Kalra (1977) was able to change the fundamental frequency 
of a male-to-female transsexual from a mean of 168 to 196 Hz 
after four therapy sessions. These patients reported satisfac- 
tion with their increased fundamental frequencies, although 
as it turned out, voice pitch was not the only method of 
effecting changes: altered phoneme production, inflectional 
patterns, and even vocabulary contributed to the total 
impression of femininity. 


Childlike Speech in Adults Adults, adolescents, and chil- 
dren are seen in clinical practice who present with childlike 
speech patterns. The effect of a mature person speaking like 
a child is created by a combination of phonatory, resonatory, 
and articulatory modifications. However, in addition to the 
speech patterns that simulate those of a child, the impres- 
sion of immaturity is often augmented by facial expressions, 
gestures, and postures that convey dependency and passive- 
ness. The aberrant speech patterns usually consist of (1) ele- 
vation of habitual pitch; (2) exaggerated and ingratiating 
inflectional patterns; (3) articulation with reduced mouth 
opening; (4) open facial expression with raising of eyebrows; 
(5) cherubic smiling; and (6) demure head movements, bod- 
ily postures, and gestures. 

Regressive speech serves the purpose of relieving the per- 
son from the responsibility of relating to others on an adult 
plane. It says, in effect, that the person does not wish to be 
regarded as an adult with the responsibilities for mutual 
interaction that an adult relationship entails. The following 
case study illustrates one variation on the theme of childlike 
speech in adults. 


Case Study 8.11 


Speech Pathology: This is a 26-year-old, divorced woman 
with two young children who is a student in a junior col- 
lege. She was referred by the school counselor, who was 
concerned about the patient’s speech because of its 
implications for her future ability to succeed academi- 
cally, socially, and vocationally. She feels self-conscious 
about her speech and is afraid to start conversations. 

On examination, she had a high-pitched voice pro- 
duced with childlike inflectional patterns. She articulated 
with small, smiling mouth postures and spoke with her 
head cocked to one side, using supplicating, ingratiating 
gestures as she did so. 

Prior otolaryngologic examination established that her 
larynx was of normal size. On phonatory examination, she 


was able to lower her pitch considerably when instructed 
to do so but was unable to sustain a low-pitched voice 
during conversational speech. 

Symptomatic voice therapy was administered during 
seven voice therapy sessions over a period of 1 month 
during which the patient was able to lower her pitch and 
eliminate many of her childlike mannerisms. Having 
accomplished this change in her external means of com- 
munication, she began to express the need for greater 
independence and complained that she found it difficult 
to use the more adult speech patterns that she had 
learned in social relationships because of the persistence 
of her childlike and dependent self-image. 

Psychiatric consultation with the possibility of psy- 
chotherapy was recommended, to which the patient 
acquiesced somewhat reluctantly as she realized that 
changing her speech alone would not solve all of her inter- 
personal and self-image problems. A psychiatric consulta- 
tion was arranged. 

Psychiatry: This patient was seen in psychiatric consulta- 
tion because of her unusual voice and speech manner- 
isms, which would appear to be associated with her basic 
personality and self-concept. Her entire demeanor, 
including her gestures, posture, manner of speech, and 
tone of voice, come across as a very small, dependent, 
and naive child. 

Her history suggests that she has always been a very 
passive and dependent person. As she talks about her cur- 
rent situation and past problems, she does not give any 
indication that she recognizes that she may have played a 
part in them. All of the difficulties in her life, as she sees 
them, are due to unavoidable circumstances or to “not 
nice” people. 

She, herself, does not actually complain about her 
voice. She indicates that a counselor who is working with 
her said that it would be better if she had “a more 
assertive voice.” She says that some people act like they 
think she has a nice voice, whereas others seem surprised 
by it. In her very childlike way, she indicates that she 
always has been very shy with people and that she doesn’t 
speak up very often. She says that she can talk to “nice 
people” but doesn’t like to talk to people who are not 
nice, saying that “some people are rude.” These terms 
nice and not nice seem to be almost her only adjectives in 
describing others. 

The patient never liked school, didn’t see much point to 
it, and didn’t see why she should have to go to school. She 
said that “a lot of the kids were not nice and some were 
rude.” She had considerable difficulty academically. 

Regarding her voice, she comments that before she 
was 13 years old, her voice was both lower in pitch and 
louder, but, even then, she said she was shy and would 
rarely speak up. She rather passively accepted others’ 
opinions that maybe her voice is a problem. 

It is my impression that this patient’s voice is simply the 
most obvious component of the overall manner in which 
she presents herself. Her speech is the external sign of a 
hysterical personality structure in which she envisions 
herself in the role of a tiny, passive, dependent child who 
almost, but not quite, speaks “baby talk” and who is upset 
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by people who are “not nice,” people who might chal- 
lenge her playing of this role. At this time, she does not 
express any insight or concern about her personality 
problems or her own pattern of behaving or relating. 

Psychotherapy summary: Throughout several months of 
weekly therapy, this patient continued to speak in a very 
high-pitched, infantile voice. There were moments when 
this type of speech would improve throughout the hour, 
but it was quite unusual for this to happen. During initial 
therapy sessions, the patient presented herself as some- 
one who is very rigid and moralistic. She would come reg- 
ularly to therapy; however, she became upset when a ses- 
sion had to be canceled and refused to come or call. She 
seemed to handle very poorly any absences or fluctua- 
tions on the part of the psychiatrist. She seemed relatively 
uninvested in the therapy and saw to it that the sessions 
remained superficial. She was quite resistant toward 
efforts to enable her to gain insight into her interpersonal 
problems and into the deeper issues causing her childlike 
demeanor. Any expression of painful affect was usually 
met by fleeing from the therapy session. She was 
extremely sensitive to any change in schedule, which was 
followed by weeks of nonattendance. Finally, she failed to 
return even after contact with her by telephone. She does 
not seem capable of handling insight-oriented psy- 
chotherapy and does not seem to have the ego strength 
to tolerate a transference relationship. 


+ Rebound Psychologic Effects 
of Voice Disorders 


To possess a voice disorder, whether psychogenic or organic, 
is to experience demoralization. Voice is part of the person. 
Self-alienation occurs when our voices are no longer “us” and 
identify is lost. Once the patient learns that the voice disorder 
is not a threat to health, an adverse secondary psychologic 
reaction sets in triggered by the patient’s rejection of his or 
her own voice and by the reactions of listeners. “What is the 
matter with your voice? You should do something about your 
voice!” Change in facial expression of listeners upon hearing 
the abnormal voice is detected easily by the patient. It is only 
a brief time from the onset of the disorder until the patient 
begins to dread meeting others, acquaintances and strangers 
alike. The individual with a defective voice elects to speak less 
and less often to avoid such reactions. We observe the 
depressing effects of voice disorders in well-adjusted 
patients with organic voice disorders as well as in those 
whose disorders are psychogenic. In addition to their primary 
psychologic discomfort, patients with psychogenic voice dis- 
orders have amplification of their already unhappy state 
because of self and listener rejection of the voice. 

If the patient’s voice disorder is severe enough to interfere 
with intelligibility, the situation can be even worse. It is dif- 
ficult to understand these individuals in noisy and even quiet 
environments, so that their voice disorders seriously inter- 


fere with their professional, business, and social lives. Conse- 
quently, the effect of the voice disorder on others and on the 
patient becomes circular and escalating: the patient’s frus- 
tration and anger elevate musculoskeletal tension, and 
whatever primary organic or psychogenic voice disorder 
existed up to that point is worsened to an even greater 
degree of severity. It is axiomatic that emotional stress exac- 
erbates virtually all voice disorders, whether organic or psy- 
chogenic. Those who are privileged to care for individuals 
who have moderate to severe voice disorders will become 
convinced that the anger, frustration, anxiety, depression, 
and discouragement that these patients experience from the 
social effects of the disorder itself is tantamount to an illness 
requiring as much attention as the voice per se. 


+ latrogenic Voice Rest 


In medicine, the term iatrogenic means a disorder or illness 
caused or exacerbated by the treatment or advice adminis- 
tered by the practitioner (e.g., a vocal fold paralysis caused 
by accidental severance of a recurrent laryngeal nerve dur- 
ing a thyroidectomy). Voice rest can produce or aggravate a 
dysphonia as in the following narrative of a patient: 


They told me not to talk too much. I feel, if I’m all right, why 
are they asking me to keep quiet? Now that they told me 
not to talk, I’m worried. Never to scream? Never to sing? 
They also told me not to cough. I’ve been holding my voice 
back. The aggravation is sitting right here (points to neck 
and chest). I can’t let it out. 


Any illness induced by the actions of the clinician is called 
iatrogenic. Voice disorders can be caused or worsened by the 
actions of the clinician. The patient who undergoes surgery 
on the vocal folds or on those regions of the neck or chest 
that may result in edema or paralysis of the vocal folds may 
experience dysphonia, either temporarily or permanently. 
What cannot be predicted in such cases is the extent of the 
patient’s suggestibility upon recovery from surgery. Upon 
hearing the abnormal voice, the patient may become anx- 
ious or depressed and may develop a musculoskeletal ten- 
sion or even a conversion voice disorder superimposed upon 
the organic one. Some patients upon hearing their normal 
voices resort to older laryngeal postures to match their pre- 
operative voice, a voice that will be maintained unless there 
is behavioral intervention to teach ways of using the revised 
anatomic structure. In others, psychogenic voice disorders 
after laryngeal surgery have occurred because of uncon- 
scious hostility toward the surgeon or for secondary gain. 

A second type of psychogenic iatrogenic voice disorder can 
begin with the sometimes ill-advised prescription of excessive 
voice rest. Korvin (2005) summarized the legends and realities 
surrounding recommendations for voice rest. She divides voice 
rest into three categories: absolute with no vocalization what- 
soever, modified with soft vocalizations limited to no more 
than 5 minutes/hour, and voice conservation. The underlying 
rationale is based on theories of microtrauma and injury that 
suggest continued phonation may interfere with restoration of 
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the molecular structure of the vocal fold and healing after 
some types of surgery. Although opinions vary, studies indicate 
1 to 2 weeks is appropriate for majority of remucosalization to 
occur without any apparent muscle atrophy. Voice rest for 
longer periods of time may have adverse affects. Korvin sug- 
gests the patients at greatest postoperative risk are those who 
smoke, who have large amounts of tissue resection, and those 
who use their voice excessively. She also notes that voice rest 
has psychologic implications and that although the impact of 
voice rest on the psyche varies by patient, it may induce feel- 
ings of insecurity, helplessness, and dissociation from the ver- 
bal world. She concludes that although further research is 
needed, and voice rest should be determined on an individual 
basis related to type and degree of surgery, voice rest is both 
possible and necessary for some surgical cases. 

An alternative opinion about this controversial issue of 
voice rest is that with the exception of a few surgical proce- 
dures requiring voice rest (and even then only brief rest is 
required), to advise patients with organic, and particularly 
psychogenic, voice disorders to whisper or remain mute for 
days or weeks is the worst advice that can be given a patient 
with a voice disorder. The larynx and voice are exceptionally 
suggestible and responsive to anxiety. When a clinician sug- 
gests voice rest, a seed of doubt is planted about that 
patient’s basic ability to phonate. The possible result is a sec- 
ondary psychogenic voice disorder. An added complication 
of voice rest is that failure to use the vocal folds causes flac- 
cidity of nonuse, possibly creating still another dysphonia in 
addition to that already present. How the idea of voice rest 
originated as a “therapy” is not clear. One suspects that it is 
the choice of clinicians who, knowing little about the care of 
the patient with a voice disorder, reason by analogy that 
inasmuch as rest is good for many things, it ought to be good 
for the voice, too. The following testimony is from a speech 
clinician who herself was placed on voice rest. 


Case Study 8.12 


It all seemed so simple, a nodule on the right vocal band 
would be removed under general anesthesia and 1 or 2 
weeks of vocal rest would be advised. The Christmas sea- 
son appeared the quietest time in terms of diminished 
teaching responsibilities with the hope that the hoarse- 
ness | exhibited for several months would be diminished. 

Optimistically, | entered the hospital situation, even 
having them obligingly do the blood work and allow me to 
go “out to dinner.” Lovely hotel, food lacked style! The sur- 
gery was uneventful, occurring early in the morning and | 
returned home that evening, drowsy but well. Flowers 


phone calls, make calls, and generally accomplish those 
things that require voice. This, in itself, would have been 
no problem. What | didn’t plan on was the feeling of iso- 
lation. The ungodly sense of being a spectator to the 
world! We read about this psychologic phenomenon; we 
talk about it when we teach. Until one experiences the 
detachment and the profound loneliness, one cannot 
readily understand what the words vocal silence really 
mean. 

Few people enjoy talking to someone who can give only 
a “yes” or “no” return. Cryptic notes, written on magic 
slate, soon lose their readability, as well as their interest. 
My husband as well as my colleagues appeared annoyed 
at the frequent taps on their shoulders or the snapping of 
my fingers: two methods | used to focus attention on my 
slate. 

As an individual who has worked long years at creating 
dialogue in my household, my family showed deep 
resentment toward the monologue situation. Mother 
could not talk about her feelings. Her body was fine, why 
didn’t she talk? My disability became almost a personal 
affront and inconvenience. Even my dearest friends soon 
tired of calling and waiting for phone taps: one equaled 
yes, two equaled no, and multiple taps meant goodbye. 
The burden of conversation on the part of the speaker 
became more problematic than anticipated. How difficult 
it must be for the parent who receives little or no response 
from their child! 

Peoples’ responses generally ranged from humor to a 
singing or gestural response that assumed that | could not 
hear or speak. One bright-eyed stranger asked me if | were 
a mute. 

The frustration of not being in control was devastating. 
One constantly needs someone else, and locked within 
you are all the reactions and impressions that for years 
you have learned to share and enjoy. The very antithesis of 
one’s basic personality was operating. 

As | write this | am still silent, eagerly awaiting the end 
of this journey. My need to write this is based on my need 
to communicate and to assist those of us who work with 
aphasia, vocal problems, and other speech and language 
problems, to attempt to gain some insight into the psy- 
chologic and emotional aspects of the problems involved. 
My entire approach to therapy has altered as a result of 
this experience. Never again will | read, hear, or advise 
“vocal rest” with the same lack of understanding or empa- 
thy (Fiedler, 1977). 
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arrived and solicitous calls came constantly. It all seemed 
a lark! | was perfectly in control of the situation. Clinically, 
it would be an interesting situation for someone in the 
field of speech pathology. What followed was a nightmare 
of psychologic trauma. 

As director of a large speech and language center, it 
seemed only reasonable that | could return to work 
immediately. My family, who at first seemed delighted 
with a silent mother and wife, quickly began to realize 
that this same person would need them to answer her 


The following study illustrates a repeated finding in the 
practice of voice disorders: when patients become dyspho- 
nic or aphonic, either unconsciously or voluntarily, as in 
vocal rest, they seem to lose their sense or feel for volitional 
phonation. It is an experience that teaches the clinician that 
the patient has undergone some sort of loss of recall, mem- 
ory, or even praxis for normal voice production. We do not 
think that the articulatory mechanism is susceptible in a 
similar way to the effects of nonuse, illustrating again the 
special nature of the larynx and phonation. 
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Case Study 8.13 


D.S., a 5-year-old boy, was found to have small bilateral 
vocal nodules. His speech clinician placed him on com- 
plete voice rest, which unfortunately was enforced for 5 
continuous months. At the end of 5 months, the nodules 
had disappeared, and the child was instructed by both the 
physician and speech pathologist to renew normal phona- 
tion. Despite all efforts by the child, he could only whisper. 
He became completely aphonic but conversed easily with 
all people with much animation and relative comfort. This 
functional aphonia remained for 2 months after he was 
instructed, “Go back and talk the normal way, Davey.” The 
child gestured that he wanted to use his voice but that he 
could not “find it.” Therapy efforts to restore the phona- 
tion were begun some 7 months after the child’s natural 
phonation had ceased (Boone, 1977). 


¢ Summary 


+ Voice may provide clues to underlying personality in nor- 
mal individuals; however, research to prove this assump- 
tion is fraught with problems of defining personality, 
vocal parameters, and acceptable research methodology. 


Specific changes in voice quality, pitch, and loudness have 
been associated with discrete emotional states, environ- 
mental stress, depression, and schizophrenia. 


 Psychogenic voice disorders are muteness, aphonia, and 
dysphonia in the absence of organic laryngeal disease, or 
with disease that is insufficient to explain the voice disor- 
der. These voice disorders can be traced to the anxiety or 
depression produced by life stress, to psychoneuroses, or 
to personality disorders. 


e Musculoskeletal tension due to environmental stress can 
produce a variety of dysphonias, sometimes even apho- 
nia, that are the effects of hypercontraction of the extrin- 
sic and intrinsic laryngeal musculature. 


+ If hypercontraction of the musculature caused by muscu- 
loskeletal tension is combined with vocal abuse, the 
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Chapter 9 


Psychodiagnostic Interviewing and Counseling 


for Voice Disorders 


A quiet hour spent in discussing the problems of the patient 
will help much to remove anxieties and restore vocal func- 
tion. 


—Brodnitz 


A few introductory remarks are in order on the subject of our 
own voices as clinicians, interviewers, and therapists. Our 
speech is a medium by which we carry on our work. In the 
diagnostic and therapeutic situation, all is mediated through 
speech. Virtually all of what we do involves two or more peo- 
ple talking together where voice is a reflection of the emo- 
tional state of the clinician as well as of the patient. It allows 
both to achieve an empathic understanding. 

The quality of the clinician’s voice influences the emo- 
tional state of the listener. Many psychoanalytic writers have 
mentioned the voice not only as a means of conveying 
thought but also as an exquisite representation of emotion. 
The patient knows the clinician’s inner state from various 
nonverbal cues, including quality of voice. The tone of the 
voice and the overall prosody of speech reflect conflict in the 
patient and alert the clinician as to the best times for inter- 
vention. 

It becomes readily apparent that voice is the primary 
expression of the individual, revealing general personality as 
well as transient emotional states. An experienced psy- 
chotherapist wrote: 


Since I have been listening to my patients’ voices I have also 
noted that changes of vocal quality accompany their 
progress. One patient achieved a lighter, less ponderous 
voice as he became more comfortable with his emotions. 
Another slowed down his rapid pace. A third achieved a 
less grating and more fluid tone. In my work I will inten- 
tionally use my voice along with my words. Sometimes I 
attempt through vocal tones to soothe an anxious agitated 
patient. Other times I use my voice to stimulate a depressed 
and hopeless one. On still other occasions I talk to give the 
patient a human response and my words are less impor- 


tant than the vocal indication of my presence. Sometimes I 
remain silent in order for me to encourage separation from 
me. Occasionally my voice backfires on me, as when a 
patient notices my anger and my anxiety through the 
sound of my voice. 


It may actually feel to the patient as though he is ejecting 
an angry or sad thought from his body. Perhaps our patients 
own their spoken thoughts not only with their ears, but with 
their entire body, which both produce sound and hear it. We 
all know instances when a patient gives evidence of strong 
physical distress while verbalizing a disturbing thought and 
then feels much more relief afterwards. We all have that 
same experience which addresses itself to the relief and 
therapeutic value of catharsis; perhaps my patient’s com- 
ment that talking gives her thoughts air is not only a 
metaphor for a psychological experience, but also a descrip- 
tion of the stream of air that flows through the vocal appara- 
tus in order to produce a sound. Sound is something concrete 
touching us. Sound is a contact experience. Sound waves are 
transmitted through the air and create tiny, yet definite 
impressions on the skin and eardrum. A loud noise is felt as 
well as heard at lower frequencies. There is a gentle, but def- 
inite, vibration distinct from any superimposed on the 
sound. The voice of the therapist creates both a psychological 
and physical bridge between herself and her patient. And we 
stress the importance of the way we touch one another with 
the sounds of our voices. 

If we consider it possible that one person can physically 
affect another, then vocal intonations of the therapist take on 
further importance. The therapist’s voice is to soothe or 
stimulate the patient, a voice that reflects the relaxation and 
the self-confidence of the therapist. A self-assured voice may 
stimulate hope and courage into the timid voice of the 
patient. Our voices along with our words help our patients 
find within themselves the capacity for change (Bady, 1985). 

Let us consider the everyday functions of the human voice 
in our lives so that we can better understand why a voice dis- 
order can have such devastating effects on us. We have 
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already learned that emotional problems cause abnormal 
voice and that abnormal voice can cause or exacerbate emo- 
tional problems. 

Following are the values that voice has for us: (1) for the 
expression of intellect; (2) as a means of emotional expres- 
sion of anger, grief, and nuances of feeling and thought; (3) 
as a means of establishing and maintaining our personal 
identity. We have seen time and again how abnormal voice 
or loss of voice is tantamount to the loss of the self: “my voice 
is not me anymore” or its equivalent is so often conveyed to 
the clinician to indicate the patient's loss of personal iden- 
tity; (4) as a means of self-assertion and of controlling others; 
(5) to assert our opinions, to project our feelings, to confront 
the world; (6) as a means of expressing our creativity in the 
formulation of ideas and as an expression of our artistic 
selves; (7) to make sound for its own sake as an escape from 
loneliness and isolation (i.e., as a defense against silence); (8) 
as an expression of our sexuality during courtship and mat- 
ing; and (9) as a way of camouflaging our true feelings dur- 
ing interpersonal communication (i.e., to deceive, disguise, 
and guard our secrets). 

You lose all of these human rights and values when you 
can no longer use your voice (this truth is nowhere better 
demonstrated than in the DVD, accompanying this text, of 
Mrs. M.K. who had relied on her voice for most of life’s 
pleasures until she developed adductor spasmodic dyspho- 
nia, which nearly destroyed her). 


+ Rationale for the Psychodiagnostic 
Interview 


During the question-and-answer period after the presenta- 
tion of a paper on voice therapy for transsexualism at a 
national convention, the speaker had emphasized how, in 
addition to changing the patient’s pitch, inflection, and 
vocabulary, she found it necessary to provide psychological 
support for the identity crises that vexed her patient, who 
was going through a profound life change. A young student’s 
hand tentatively rose, and with a voice and frowning expres- 
sion that clearly communicated her perplexity and confusion, 
she hesitantly asked, “But... are... we... psychologists?” In 
that single question dwelt all the ambiguity, ambivalence, 
and uncertainty that has characterized our profession’s 
stance on the question of the responsibility for learning, 
knowing, and practicing psychologic interviewing and coun- 
seling with patients who have communicative disorders. 
The predominant attitude in our academic training toward 
this subject has been to avoid it, narrowly defining the clini- 
cian’s field of practice as retraining vocal habit patterns. We 
believe it is impossible and unethical to treat people who 
have voice disorders without weaving into diagnosis and 
therapy the interviewing skills of the psychiatrist, clinical 
psychologist, and psychiatric social worker. Any in-depth 
study of voice disorders forces us to conclude that—so long as 
clinicians obtain privileged information from patients; so 
long as people have voice problems because of life stress and 
interpersonal conflict; so long as voice disorders produce 


anxiety, depression, embarrassment, and self-consciousness; 
so long as patients need a sympathetic person with whom 
they can talk about their distress—we will need to consider 
our training incomplete until we have learned the basic skills 
of psychologic interviewing and counseling. Most psy- 
chogenic voice disorders, particularly of the conversion type, 
are unlikely to improve without a proper balance between 
direct voice therapy and psychotherapeutic discussion. 
As a psychiatrist, Kolb (1971) wrote: 


Today the interview is used by members of all the profes- 
sions directly concerned with sustaining or improving 
mental health. All are confronted with the abnormal from 
time to time. All elicit information concerning personality 
traits and symptoms that are distressing to the patient. All, 
too, must conduct themselves in such a way as to establish 
the compassionate understanding of the problem revealed 
to them by the person being interviewed. 


Speech-language pathologists who treat patients with 
voice disorders require interviewing and counseling skills to 
aid them in determining the presence of emotional factors 
and the extent to which they are instrumental in causing or 
perpetuating voice disorders. They must recognize patients 
who are in need of professional psychodiagnostic and psy- 
chotherapeutic help when their emotional problems over- 
shadow their voice disorders or when the emotional aspect 
needs to be managed before the voice disorder can be 
treated. If the speech-language pathologist has neither the 
skills to bring out the voice patient’s troubles nor enough 
knowledge to recognize serious emotional illness, a disser- 
vice is done to the patient. Stated here clearly and unequivo- 
cally, we are not advising that speech-language pathologists 
conduct psychotherapy but rather that they obtain back- 
ground information to identify etiology and to relieve acute 
psychologic distress by providing psychologic support. Its 
purpose is the etiologic differential diagnosis of the voice 
disorder and the identification and referral of voice patients 
to psychiatrists, psychologists, and other mental health 
specialists. 

In a study by Butcher, Elias, Raven, et al (1987), an attempt 
was made to determine the value of the speech-language 
pathologist functioning as a psychotherapist in collaboration 
with the clinical psychologist for patients who were unable to 
respond to direct voice therapy alone. The authors concluded 
that psychotherapy as a treatment for psychogenic voice disor- 
ders was able to achieve a 50% success rate in patients for 
whom voice therapy alone had been unsuccessful, and that a 
psychotherapeutic model for patients with psychogenic voice 
disorders with the speech-language pathologist and the psy- 
chologist working as cotherapists was highly workable in 
treating psychogenic voice disorders that had not responded to 
traditional approaches. They discovered that speech patholo- 
gists using psychotherapeutic techniques with patients who 
had psychogenic voice disorders achieved in their patients 
improved voice and positive psychologic changes. 

After discovering a considerable divergence of training, 
attitudes, and practice in the treatment of patients with psy- 
chogenic voice disorders in the United Kingdom, Elias, Raven, 
Butcher, et al (1989) surveyed 244 practicing speech clini- 
cians to determine their current opinions and practices in the 


9 Psychodiagnostic Interviewing and Counseling 195 





diagnosis and treatment of psychogenic voice disorders and 
whether or not they thought they were properly educated to 
deal effectively with this population. In their survey, they 
paid particular attention to preparation in psychology and 
psychiatry and their experiences in relating to these profes- 
sions within the framework of voice disorders. What they 
found was that: 


+ About 75% of speech clinicians surveyed said they always 
treated psychogenic dysphonia or aphonia when patients 
were referred to them. 


+ Asked how confident they were in their understanding 
and treatment of these disorders, 4% said they always 
were, 58% that they usually were, 33% said some of the 
time, and 5% said never. 


+ About 75% of the clinicians said that their patients usually 
improved. 


+ A variety of treatment techniques were used by the 
speech-language pathologist population surveyed, the 
main ones being relaxation, voice exercises, and counsel- 
ing. A minority of clinicians used a diversity of psycho- 
logic techniques. 


+ The clinicians expressed the opinion that they could 
expect more improvement from psychogenic dysphonic 
patients than from aphonic ones. 


+ Although voice exercises, breathing, and relaxation meth- 
ods were taught at the undergraduate level, psychologic 
techniques were more likely to have been learned subse- 
quently. The observation that counseling was much more 
freely advocated than taught corresponded with com- 
ments that it was important and not as extensively taught 
as needed. About 70% would still welcome further train- 
ing in the treatment of voice disorders from a clinically 
involved psychologist or psychiatrist, and several respon- 
dents believed that more graduate courses ought to be 
available than there are at present. 


è More than 75% of the respondents considered their 
undergraduate course in psychology and psychiatry as 
“mainly theoretical.” 


+ Only 15% rated their psychology lectures as related to 
their voice disorders lectures; the remainder were 
equally divided between those who perceived little rela- 
tionship and none at all. 


Referral of patients with psychogenic voice disorders to a 
psychologist or psychiatrist was practiced by 74% of the 
respondents, of whom 42% felt uncertain as to when to 
refer. More than 75% would have liked better access to a 
psychologist or psychiatrist but indicated that a barrier to 
cooperation with these professions was the mental health 
professionals’ limited interest in voice disorders; in our 
experience, lack of knowledge of the field of voice disorders. 


Based on their survey, the authors concluded that there is 
no reason to dispute the textbook advice that patients with 
psychogenic aphonia and dysphonia should be treated by 
speech-language pathologists and that improvement usually 
follows. They expressed their support for the concept that 


therapy for psychogenic voice disorders should not be 
addressed solely to the voice itself, quoting Greene (1972): 


Recovery of voice, however, is not the sole aim of treatment. 
Therapy must aim at removing or alleviating the cause and 
obtaining better adjustment of the patient to his difficulties 
by gaining some insight into the connection between the 
voice symptom and the precipitating factors. 


In their discussion of the results of this survey, the authors 
concluded: 


It is, therefore, important that any speech therapist’ treat- 
ing psychogenic voice disorders, which are not uncommon, 
should have psychological skills, and a clear idea when 
cooperation with, or referral to, a different specialist would 
be the best course. In this context, it is disappointing that 
education in psychology and psychiatry is perceived as 
remote from everyday practice. Moreover, many speech 
therapists have found it necessary to acquire important 
psychological skills after qualification. 

The education of speech therapists should always incor- 
porate counseling and other relevant psychological skills. 
These are already taught in some centres. This would pro- 
vide a welcome application of the theory and an appropri- 
ate preparation for current practice. Speech therapists 
should be prepared before qualification for undertaking 
joint therapy with a psychologist or psychiatrist, since this 
is not a rare undertaking when there is an interested per- 
son available. Post-qualification courses are essential to 
enable therapists already in practice to acquire the psycho- 
logical skills they have found they require. 


It is difficult to convince most people how deeply voice 
disorders strike at the heart of the patient’s total existence 
and how special voice is in the average person’s emotional 
and intellectual life. Always impressive is how chronic voice 
disorders can unhinge even the most stable personality. The 
following letter was written by a college professor who had 
undergone recurrent laryngeal nerve resection for adductor 
spasmodic dysphonia. She was having intermittent difficul- 
ties during the postsurgical voice-stabilization period and 
wrote the following letter to a fellow patient with similar 
voice-adjustment problems. The letter is illustrative of not 
only the impact of the voice disorder on the patient but also 
of the critical nature of the support from her speech-lan- 
guage pathologist. 


Case Study 9.1 


| was reliving anight ~15 months after my surgery when | 
read of your recent frustrations. | had been getting along 
beautifully off and on but then | had about 4 or 5 days in 
succession when my voice sounded like a scratchy whis- 
per. | just knew that the worst possible was happening— 
that my remaining vocal cord was giving out and that | 
would become voiceless. | was actually crying one night at 
9:00 pm, filled with fear, and the phone rang. My husband 


lin the United Kingdom, speech therapist is the official term for 
speech-language pathologist. 
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answered. The call was for me. It was the speech patholo- 
gist calling to check how | was getting along. Can you 
imagine getting a call at that hour? Well, | poured out my 
fears to him and a very noticeable softness and quietness 
became evident in his voice. He told me that he had never 
heard of anyone losing the use of the second vocal cord. In 
that quiet voice that sounded as if he were suffering with 
me because of my fears, he gave me a great amount of 
strength by telling me not to worry about it—that he did- 
n’t know why I was experiencing what | was going through 
but that he felt sure that within a short time my voice 
would be back to what it had been in the weeks before this 
very frightening period. The assurance, the calming 
effect, the knowing that he was concerned about me after 
so long a time was the best medicine that | could have 
had. And do you know what? The very next morning my 
voice was back! | can’t explain it. Unless the horrible fear 
of truly losing the ability to speak caused the voice to be 
worse than it really needed to be. 

| think we must both be concerned too much with how 
we sound to others. Even yet | am very self-conscious of 
how | sound to others much of the time. | am getting bet- 
ter. Sometimes | actually forget about my voice quality, 
but most of the time | hear my voice as something less 
than what | would desire if | had a chance. But | am so 
thankful for what | have that | don’t know why I can’t do as 
you suggest—talk as | do with what | have and forget 
about what people think about it. 

My voice has been strong for the most part during the 
Christmas holidays. But today it is thick, as though there is 
a fog within me. | dreaded answering the phone a while 
ago. Frustrating. | don’t know why our voices fluctuate so. 
Frankly, | do not believe that you should quit your job. You 
see, since | teach, | have 3 months in the summer when | 
don’t teach. My voice has the same periods of fluctuation 
then as it has when | teach. | think life itself causes our 
bodies to change. For a normally healthy person without 
the voice problem that we have, these changes are not 
evident as they are in us. And in the summer when | am 
not working and under no stress and strain, | will have my 
days of bad voice. On my bad days my voice sounds thick, 
sort of gravelly, and has very little volume. When | try 
increasing the volume, some of my sounds come out as 
air instead of audible sounds. Frustrating? | can’t tell you 
how very much. And of course that just makes it all the 
worse. But why wouldn’t | be frustrated when | am lectur- 
ing to college students and my voice comes out in a stu- 
pid-sounding way. But the point is that it happens on 
weekends, in the summer, during Christmas vacation, etc. 

Does your voice carry in a large supermarket? Does your 
voice perform normally in any large area? Mine doesn’t. | 
need walls for the voice to bounce off of, it seems. My 
voice is always much better in a small room. Nearly all my 
classes are conducted in rooms that can accommodate 30 
people. When | teach in a larger room, | have to work 
harder. However, | found out just very recently that even 
when my voice seemingly has very little volume, it has 
something that causes it to “carry” much better than one 
would have reason to hope for. But in a noisy large area 
such as a supermarket, a listener has to come quite close 


to me to hear what it is I am saying. And I have a lot of trou- 
ble talking to the supermarket clerk when | am being 
checked out. When they talk to me and ask me some- 
thing, | answer, but they have problems hearing me. | feel 
very embarrassed, hurt, and frustrated at such times. 

Now, after all that complaining, | must quickly say that | 
am so thankful for having had the surgery, for having the 
ability to talk without that horrible strain that | had prior 
to surgery, for being able to function well enough to be 
able to keep my teaching job. It still is the best thing that 
ever happened to me. 


The reasons that the speech-language pathologist needs to 
be trained in psychologic interviewing and counseling are 
the following: 


+ The high incidence of psychologic causes and effects of 
voice disorders requires that these etiologic factors be 
investigated so that they can be understood and dealt 
with. 


+ Improvement or even disappearance of the voice disorder 
is common during the patient’s disclosure of interper- 
sonal problems causing the disorder. 


+ Parceling out the patient with a voice disorder to psycho- 
logic personnel to obtain basic personal examination data 
and to manage less critical emotional problems is often 
impractical. 


+ Patients who have psychogenic voice disorders may 
relate better to a psychologically oriented speech- 
language pathologist (who is not officially defined as a 
mental health specialist) during the initial phases of 
evaluation and therapy than to a psychiatrist or psychol- 
ogist, because some voice patients tend to reject such 
referrals. The speech-language pathologist can lead the 
patient into areas of psychologic sensitivity and gradu- 
ally educate the patient about the need for professional 
counseling. Such an approach has salvaged many a 
patient who otherwise would have become offended and 
who would have rejected premature referral to mental 
health personnel. 


+ Many voice disorders are caused by mild, transient inter- 
personal problems and not by deeply embedded conflicts 
or life-long psychoneuroses. These patients are not a dan- 
ger to themselves or to others. They can be safely and 
conveniently managed by the speech-language patholo- 
gist trained in psychologic interviewing and counseling. 


è In clinics in which speech-language pathologists have 
easy access to laryngologists, psychiatrists, and psycholo- 
gists, the risk of encountering trouble dealing with critical 
emotional problems of voice patients is practically elimi- 
nated. 


+ The interviewing and counseling skills learned by speech- 
language pathologists increase their own self-under- 
standing and their understanding of others. It teaches 
them how to ask questions and how to deal effectively 
with people in trouble. 
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¢ Mental health professionals often have a full patient load 
and patients have difficulty being seen immediately, 
delaying treatment and return of their voices. 


¢ Characteristics of a Successful 
Interviewer 


The most important ingredient in successful interviewing 
and counseling is the speech-language pathologist as a per- 
son. Clinicians who communicate best with people who 
have problems possess the following traits: 


¢ Their orientation to life and to others is one of equanimity 
and acceptance. Voice patients come from all cultural, 
educational, and economic backgrounds, talk differently, 
look different, and harbor beliefs and habits of unimagin- 
able varieties, many contrary to the clinician’s. If one is by 
nature hypercritical, judgmental, prejudiced, or threat- 
ened by such behaviors, although not verbally disapprov- 
ing, these attitudes will nevertheless be communicated in 
other ways. Consequently, clinicians must learn to accept 
patients’ habits, ideas, and lifestyles as belonging to them, 
separate from their own, and they must deal with 
patients within a framework of the patient’s life orienta- 
tion. 


+ They respond to personal information with understanding 
and encouragement. Clinicians must provide an atmos- 
phere in which the patients perceives that the clinician is 
very much interested and concerned about them. 


+ They know how to ask questions. The speech-language 
pathologist must learn to ask open-ended questions that 
will encourage the patient to free-associate. “Tell me 
about your family, your work, your relationships.” The cli- 
nician should interrupt only when the patient seriously 
strays off the track or skirts personal data, reciting names, 
dates, and places, thereby avoiding disclosure of personal 
information. The clinician’s objective is to elicit how the 
patient feels and not just a recitation of events. 


+ They know how to listen. Clinicians must learn to 
become attentive listeners and to refrain from giving 
advice. This rule has exceptions, but in general speech- 
language pathologists want to help patients solve their 
own problems by allowing them to rely on their own 
resources. 


+ They are perceived as sympathetic and trustworthy. As 
such, patients will find it easier to confide in clinicians. To 
witness the superficial, defensive, organ-oriented patient 
give up facades and become more human, reacting on a 
more personal level, using more personal language, is a 
satisfying accomplishment. Tears well up in their eyes. 
They will apologize for losing emotional control; the cli- 
nician reassures them that it is proper and desirable for 
them to express themselves. Full expression of emotion in 
interviewing and counseling for voice disorders is tension 
relieving. Such an abreaction has untold clinical impor- 


tance. During such release, psychogenic voice disorders, 
or organic disorders with psychogenic components, often 
disappear, convincing clinician and patient alike that the 
voice problem was wholly or in part related to unex- 
pressed conflict and emotions, a point the patient often 
resists up until such breakthrough. With emotional 
release, patients suddenly find themselves freed to dis- 
cuss situations openly that they have been suppressing. 
The speech-language pathologist is often the first and 
only person to whom they have disclosed this private 
information, and they feel a tremendous relief for having 
done so. What often begins as a “voice disorder” ends as a 
sign/symptom of a larger interpersonal conflict that 
demands a solution; the voice disorder was only the “tip 
of the iceberg.” Clinicians must recognize and accept that 
many voice disorders are unconscious substitute solu- 
tions for patients who cannot solve larger problems. The 
speech-language pathologist’s job in such cases is to help 
the patient recognize that the voice is only a surface sign 
and that they need to face and solve embedded problems 
before a normal voice can be attainable. Many voice dis- 
orders occur because of interpersonal conflicts in which 
the patient has suffered an injustice for which that person 
has not had his or her “day in court.” To have one’s “day in 
court” is to be able to express one’s need for justice, 
something these patients have been unable to do verbally. 


They are willing to set aside adequate time for the psycho- 
logic aspects of the examination and therapy. Psychologic 
interviewing and counseling take time. No matter how 
knowledgeable or skilled the clinician, if a special block of 
time is not set aside for leisurely discussion, then rarely 
will the patient open up. Rushing the examination is anti- 
thetical to good interviewing and counseling. 


The following scenario is not unusual in the life of the 


speech-language pathologist specialist in voice disorders: 


It is 3:30 in the afternoon on a Friday. One ought not to be 
distracted by thoughts of the coming weekend's diversions, 
but one is, nevertheless. This is the last patient of the day, 
and I wish for something simple. The history is as follows. 
The patient was referred to me by an otolaryngologist. She 
had been referred to him by her internist because of dys- 
phonia and throat pain. Her general health was good. Her 
main reason for coming to the clinic was back pain, neck 
pain, and hoarseness. She feared cancer. For her low back 
pain, she had seen a physiatrist, who had prescribed relax- 
ation exercises and who told her to lose weight. No etiology 
of her pain had been formulated along the way, and she 
was being treated symptomatically. 

The laryngologist found her larynx to be clean—not a 
shred of evidence of a structural lesion or paralysis, and he 
reassured her that she did not have laryngeal cancer. She 
had pain and tenderness, especially on the right side of her 
neck just beneath the angle of her mandible level with the 
hyoid bone, and the area was sensitive to pressure. The 
laryngologist injected lidocaine into the region, which 
numbed the pain temporarily, but her dysphonia 
remained. He then sent her for evaluation and treatment of 
her voice disorder. 
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She was a neatly dressed, ruddy-faced woman of 56 
from Saskatchewan where she and her husband farmed, 
apparently prosperously. This was her second marriage 
and had lasted 15 years. There were no children. She 
described the marriage as happy, and she denied life prob- 
lems associated with her voice disorder. Gradually, 3 years 
ago, she became aware of both tightness in her throat and 
episodes of hoarseness, which became chronic. The throat 
pain began during her dysphonia. 

On examination, her voice was hoarse and strained, 
resembling adductor spasmodic dysphonia, except that the 
hoarseness did not fluctuate; it was continuous. It did not 
have quite the proper spectrum for spasmodic dysphonia. 
Her entire neck musculature felt rock stiff to the touch, and 
she winced and withdrew when I touched her neck just 
below the angle of the mandible. 

Did she abuse her voice, | asked? How much, and under 
what circumstances? Not very much, she shrugged. Once in 
a while she would shout across a field to her husband, but 
she was not a talker. I wondered if she felt isolated living on 
such a large farm with few neighbors. But, she claimed she 
was content. 

The discussion fell off into silence. She gazed at the wall 
and frowned. “Come to think of it,” she said, “I have had to 
raise my voice in talking to my husband. He seems to be 
getting hard of hearing. It’s irritating. I say something and 
he asks me to repeat it; it gets tiring. No, he has never had 
his hearing tested.” We talked about the possibility that 
voice strain and general muscular tension might have been 
brought on by her having to speak in this way to her hard- 
of-hearing husband. I encouraged her to get her husband 
to see an otologist to determine the cause and treatment of 
his hearing loss. 

By now it was 4:30 pm and I was hoping for an ending to 
this evaluation. But I felt irritated with myself as I lifted half 
off the chair to extend my hand to say goodbye. The next 
words came as if from some other jurisdiction in my head. I 
dropped back into my chair. “There’s one more question I’d 
like to ask, if you'll permit me. How is your sexual relation- 
ship with your husband?” For the first time in over an hour, 
she looked me dead in the eyes. There was an excruciatingly 
long period of silence. Then she turned her head so that I 
could not see her face. She turned back, eyes closed, head 
down. A tear rolled down each cheek and cascaded onto her 
lap. She was choked up and could not speak but answered 
with a negative shake of her head. When she regained her 
composure, she explained that during the past few years her 
husband's sexual interest had declined to nearly complete 
inactivity. Until then, their relationship had been good. Then 
she found that she would have to initiate any relations, and 
most of the time he would decline, saying that he was too 
tired or wouldn't be able to perform. “How did you feel about 
this?” I asked. Tears again welled up in her eyes, and she said 
that she guessed she'd have to accept the situation, but had- 
n't quite anticipated such an early sexual demise. She said 
that she felt guilty for having brought up the subject because 
she loved her husband and, with that, she cried bitterly. We 
talked about her husband for a while, that he was only 58, 
still relatively young. Perhaps, she thought, his growing obe- 
sity might have something to do with his declining libido. 


By 5:15 pM, at least 50% of her hoarseness had disap- 
peared. We talked about marital counseling, and she was 
certain that their family doctor could help them find a 
counselor closer to home. We both sat in wonder at how 
the larynx and voice react to pent-up emotions and how 
the throat pain came from the suppression of crying. As she 
left, she thanked me for bringing her problem into the 
open: “Nobody else ever had,” she said. 


They are patient and persistent. We have noted the ten- 


dency for patients with voice disorders to deny emotional 
problems. Because patients skirt main issues, which they 
are either unconscious of or unwilling to divulge, the cli- 
nician must be prepared to wait and must not be afraid to 
press ahead for information when he or she feels that 
information being obtained is sterile and that he or she 
detects something beneath the surface that the patient is 
concealing. With patients who have voice disorders, the 
completely nondirective approach is not quite the right 
one. The optimum falls somewhere between the directive 
and the nondirective. 


They are authentic. Many professionals have the idea that 
professionalism is equated with formality, rigidity, and 
even coldness. They entrench themselves behind their 
desks. Their posture, the architecture of the room, and 
harsh lighting restrict verbal and emotional intimacy. The 
clinician should get out from behind the desk; sit closer to 
the patient; look the patient in the eyes. It is important to 
try to feel what the patient is feeling; one should not be 
afraid to agonize with the patient. If the clinician has had 
lifelong difficulty expressing emotions, he or she is going 
to have difficulty doing so with patients. The world is cold 
and impersonal, and people do not display their feelings 
easily. It is rare for patients to meet someone who 
behaves humanly. They are grateful for it. 


They are honest. Patients respect professionals. Some- 
times, clinicians think patients imbue them with more 
knowledge than they deserve. There is a dangerous 
temptation for the clinician to erect a facade of superior- 
ity and authority. Patients ask many questions about the 
causes of and therapy for their voice disorders. Could it 
be caused by this pollution, that postnasal drip, or an 
unfriendly magnetic field that surrounds them, some 
incident that happened years ago? At such times, there is 
a temptation to appear more knowledgeable than reality 
justifies. If clinicians do not know, they should say so. 
Patients have to learn to accept the clinician’s fallibility 
as well as their own. Nonetheless, clinicians are being 
consulted for their knowledge and, therefore, should 
communicate technical information as clearly and effi- 
ciently as possible. For this reason, thorough knowledge 
of voice disorders and related fields is imperative. Speech- 
language pathologists must be as well read as possible, 
and if they do not know the answers to certain questions, 
they should find out for themselves and for their 
patients. 


They help the patient to form his or her own conclusions. 
When it comes to helping patients realize the relationship 
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between a psychogenic voice disorder and events in their 
lives, clinicians must encourage patients to speculate 
whether there might be such a connection, and clinicians 
must not make interpretations for them. People who 
arrive at their own conclusions believe them more firmly 
than if they are told. The process is called insight. It is a 
coming together, a gestalt, a realization of the intercon- 
nection among things. 


+ Stages of Personality Growth in Patients 
with Psychogenic Voice Disorders 


A patient with voice disorders of primarily psychogenic ori- 
gin is experiencing a failure to express his or her self. Clini- 
cians must help that person learn a greater freedom of verbal 
self-expression. There is no magic to this kind of therapy. It is 
nothing more or less than giving the patient encouragement 
to achieve what he or she has wished for many years: to 
express the self at any level with a feeling of freedom. The 
pioneering originator of client-centered psychotherapy, Carl 
Rogers (1961), has analyzed the process of change from 
lesser to greater communication with the self and the out- 
side world as a result of psychotherapy. 

The process is one of change from fixedness to flowingness: 


Stage I 
+ unwillingness to communicate with the self 


+ feelings and personal meanings neither recognized 
nor owned 


+ rigid personal constructs. Communicative relation- 
ships construed as dangerous 


no problems recognized by the patient 
no desire for change 


+ considerable blockage of internal communication 


Stage II 


+ Expression begins to flow on the subject of non-self 
topics. 


+ The patient perceives problems as being external to 
the self. 


There is no sense of personal responsibility for those 
problems. 


Language indicates that patient does not “own” ideas 
expressed, and he does so in the past. 


Feelings expressed are unrecognized and unowned. 
+ Experience is bound by the structure of the past. 


Contradictions are expressed with little recognition by 
the patient that they are contradictions. 


Stage III 


+ freer flow of expression about the self as an object 


Stage IV 
+ expresses more intensive feelings 
+ greater expression of feelings in the present tense 


+ loosening of the matter with which experience is 
intertwined 


+ greater feeling of self-responsibility for problems 


+ Common Psychopathology in Patients 
with Voice Disorders 


Patients who have voice disorders are often in need of help 
because of psychopathology either related or unrelated to 
their voice disorders. Recognition and referral for psychiatric 
consultation may or may not be productive. The following 
material has been abstracted from The Psychiatric Interview 
in Clinical Practice by MacKinnon and Michels (1971), which 
is strongly recommended reading. 

Psychopathology is concerned with emotional disorders— 
neuroses and psychoses. Behavior and character disorders 
are included; they interfere with the patient’s ability to func- 
tion properly at home, at play, at work, and in affectional 
relationships. 

The distinction between neurotic and character traits is 
that neuroses consist of compromises between repressed 
wishes and unconscious fears in which the individual is try- 
ing to deal not only with the real world but also with inner 
anxieties. The neurotic is aware of something foreign in his 
or her makeup and wishes to be free of it, whether depres- 
sion, anxiety, phobias, obsessions, compulsions, or conver- 
sion reactions. Character traits, on the other hand, are more 
basic to the individual’s personality, and the individual feels 
relatively comfortable with them. Nevertheless, such people 
may be irresponsible, impulsive, aggressive, and mistrustful. 
The following are common clinical syndromes found in 
patients with voice disorders. 


The Obsessive Patient 


The main character traits of the obsessive personality are 
punctuality, conscientiousness, tidiness, orderliness, and 
reliability, stemming from a fear of violating the rules of 
social conduct. Parsimony with time and money is character- 
istic, as is exaggerated concern about proper social codes and 
competitiveness. Intellect is used to avoid confrontation 
with the emotions; the obsessive is a cerebral individual. 
Vocabulary and theoretical constructs are highly developed, 
and the language used in communicating with others is 
often confusing rather than clarifying. An overreliance on 
detail often masks the major point of the commentary. 

The interview is characterized by a tendency for the obses- 
sive to control the interviewer and the interviewing situation 
to avoid the expression of deeper feelings. Definitions, termi- 
nology, and other intellectual aspects of behavior are chosen 
by the patient for discussion rather than underlying feelings 
and emotions. Sometimes the obsessive overcontrols the 
interview to such an extent that the roles of interviewer and 
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interviewee appear reversed. To avoid disclosure of underly- 
ing feelings, the patient may show characteristic speech pat- 
terns, particularly in the voice. Loudness may be low or 
speech articulated so poorly that the interviewer will not be 
able to hear what the patient is saying. Silence often occurs; 
it is another technique used to avoid emotional contact. 


The Hysterical Patient 


The hysterical personality is warm, imaginative, charming, 
and exuberant. Such patients are often attractive physically, 
as well as in personality. Verbal communication is character- 
ized by the expression of inner feelings of emotion in relat- 
ing experiences. Exaggeration is a common quality of their 
language. Overdramatization is at the core of their commu- 
nications. 

The reason that these traits are considered pathologic is 
that they are superficial, and the patient is often devoid of 
any real feeling of affection, love, or intimacy. Although in 
outward appearance the patient may seem at ease and con- 
fident, beneath may lie anxiety and insecurity. Female hys- 
terics are often seductive and are possibly a trap for the male 
clinician, but the purpose of this behavior is to obtain 
approval and admiration rather than sexual intimacy. Help- 
lessness and dependency may pervade the clinical relation- 
ship, with the patient attempting to maintain a childlike 
relationship with the clinician. 

In contrast with the obsessive patient, the hysteric lacks 
orderliness and punctuality. Ordinary daily tasks are boring 
and avoided. Thinking is impulsive and intuitive. Commonly, 
there are marital or sexual problems because of the fear of 
emotional expression. Apropos to voice disorders, patients 
with hysterical personalities frequently develop conversion 
voice disorders as well as other somatic complaints. 

Although interviewing hysterical patients is often person- 
ally and socially pleasant, such patients can produce sterile 
interview results because of their inability to discuss deeper 
feelings. Overdramatization and role-playing become irritat- 
ing to the clinician attempting to get beneath the surface. 
The language of the hysteric impedes communication of the 
true self, as does the obsessive’s overintellectualization. In 
the hysteric, the language is different, however; it is one of 
overdramatization, exaggeration, and distortion. The clini- 
cian gets the uneasy feeling of a lack of congruence between 
the patient’s language and what the patient is actually feel- 
ing or the facts the patient is disclosing about his or her 
background and experience. 


The Depressive Patient 


Sadness, helplessness, and reduced self-esteem are mild 
signs of depression with which all humans must contend. 
Clinically depressed patients lose interest in life, however, 
and have diminished appetite and little enthusiasm for the 
general pleasures of life. Such people are preoccupied with 
themselves, ruminating about the past. Anxiety can be part 
of the syndrome, and so can anger. Overall, the depressed 
patient is slow of thought and movement, is preoccupied 
with bodily complaints and health, has difficulty in sleeping, 
awakens early, is fatigued, and often has changes in appetite. 


Although many depressions stem from lifelong family path- 
ologic conditions, others are considered constitutional or 
biologic. Many individuals become depressed over the loss of 
friends or relatives through death or separation. Paradoxi- 
cally, certain people become depressed after promotion or 
other forms of recognition that threaten to expose incompe- 
tence, either real or imagined, or because success is equated 
with hostile aggression. 

Patients with voice disorders are usually depressed. It is 
either a cause of the voice disorder or a result of the voice 
disorder’s interfering with their enjoyment of life. If the 
depression is the effect of the voice disorder, voice improve- 
ment often resolves the depression. Otherwise, psychiatric 
consultation and therapy are necessary. Often, patients with 
voice disorders are depressed because they are under the 
misconception that they have “caused” the disorder. Their 
depression is often alleviated when they are reassured that 
such is not the case; clinicians have an important educa- 
tional function in this regard. In addition, the hope that can 
be generated from voice therapy, as well as the encourage- 
ment of the speech pathologist, can contribute considerably 
to the alleviation of depression. Information and reassurance 
go a long way in helping the patient with a voice disorder to 
attain a happier frame of mind. 

Which of the above mentioned personality types do you 
recognize in the following histories? 


Case Study 9.2 


A 30-year-old mother of three children has been under 
stress from several sources: She works at a supermarket 
full-time to supplement the family income while having to 
maintain her household. She is dominated by her mother, 
who is hypercritical of how the patient manages her life. 
The patient and her husband disagree on many matters, 
and communication between the two is very poor 
because both have emotional problems. She has had sev- 
eral previous episodes of aphonia and dysphonia, but on 
interview her voice is nearly normal. Laryngologic exami- 
nation was normal. 

Clinician (C): Your voice sounds fairly normal to me. Is 
this the problem that you came to the clinic for? 

Patient (P): Well, I’ve had trouble off and on for about 
the last 2 years. 

C: Do you actually lose your voice completely so that 
you have to whisper? 

P: Back on the 18th of September, | could hardly get it 
up at all. | felt as if | was dragging everything right on out. 
But usually it is mostly a whisper. 

C: What is the longest period of time that you have lost 
it? 

P: Well, |wouldn’t say that I’ve completely lost it, except 
for this last time on the 18th of September. But it’s always 
been hoarse, almost gone. 

C: How longa period of time had you lost it starting with 
the 18th of September? 

P: Well, it sounded like | had laryngitis for about a week 
before. And then it just left. 


C: Did you go around whispering then? 

P: Well (laughs), | could hardly do even that. | mean, | 
felt like | was pulling everything right on out. My husband 
called the doctor and he gave me some medicine. It took 
~24 hours before it took hold. 

C: And did you lose it again? 

P: It got hoarse, but | never lost it again. 

C: Then why did you come here? 

P: My doctor sent me here to see this guy (laryngolo- 
gist) that looked down my throat, and he referred me to 
you. 

C: Did anything happen on or about September 18th 
that might have upset you? 

P: | wouldn’t say it’s any one thing. | can go back several 
years. | work, and | was hired to work 3 days a week. Well it 
so happens that | think the first time it happened was when 
| got left alone—we usually have three girls working—and | 
handled the whole burden plus three children at home, 
housework. Ever since then | have been carrying more of a 
load. They’ve been quitting, coming and going, and now 
we've been one gal short since the last of March, and this 
hoarseness has been pretty steady since then. So | think 
it’s an emotional thing. 

C: Why do you say it’s emotional? 

P: Well, | think it’s my nerves because I’m trying to do 
too much. You know, putting in 5 days a week, 8 and a half 
hours a day, plus taking care of a home, children. 

C: As you tell this to me, you seem to be on the verge of 
crying. (Mention of this brings tears to the patient’s eyes.) 

P: Nerves. (Smiling, holding back tears.) 

C: How about people who are close to you at home and 
at work? Are you having any problems with any one indi- 
vidual? 

P: Hmmm. Not now, not since this one gal quit. She 
wanted a job. She had to have a job to get out of the house 
because she was tending to be an alcoholic, but she didn’t 
really want to work and | was more-or-less carrying her 
load and my load both. Well, she was getting on my 
nerves. Other than that, | don’t know. My kids get on my 
nerves. 

C: How about your husband? (Note at this point her 
unawareness or her unwillingness to divulge the problem 
of her husband as a major source of her unhappiness and 
how it comes to the surface later in the interview.) 

P: Very seldom. 

C: How about your relatives? 

P: (Laughs) My mother. 

C: What about your mother? 

P: (Patient gives a long sigh.) Well, oh, she keeps trying 
to run our lives, | guess telling me what to do and what | 
shouldn’t do, sometimes running me down. 

C: How does she try to run your lives? 

P: Well, | don’t know. Since we’ve been first married, 
well, | don’t know, she says my husband hasn’t got a good 
enough job, or | don’t call her often enough. She has never 
approved of the doctor we go to, and (long sigh) she’s 
started in on the dentist | go to, and | don’t know, | never 
keep my house clean enough. She seems to think | am not 
strict enough with the kids, training them, you know, to 
say “thank you” or “please.” | don’t know, it’s a lot of 
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things in general (close to tears again). | guess | let too 
many people bother me, I think. 

C: What else? 

P: Gee, | don’t know. | don’t know if there is anything 
else. My husband and | are perfectly happy, we don’t have 
very many arguments. | know we don’t see eye-to-eye on 
a lot of things. 

C: Like what? 

P: But | try not to contradict him, | let him have his own 
way. 

C: What kinds of things don’t you see eye-to-eye on? 

P: Politics, a few things at church (laughs, long pause). 
We-l-l-| (word prolonged), we’ve been rivals through high 
school (becoming mildly indignant). Now, this is a silly 
thing. Every year at this time (becomes more tearful, 
voice beginning to quaver), he runs down my school, talks 
up his, every time sports start—especially basketball 
(laughs). 

C: Does he do it good-naturedly, or does he do it mali- 
ciously? 

P: Well, lm always wrong. No matter what | say, he runs 
it down. 

C: Everything you say? 

P: Other things, too. 

C: So, there is some problem between you and your 
husband. 

P: Hmmmmm. | suppose you could say that. Let him 
have his way. Don’t argue with him. What’s there to do? 
Instead of causing an argument, | keep my mouth shut. 

C: Has this been going on since you were married? Or 
has it been more recently? 

P: Well, | don’t know. When he used to listen to his ball 
games, | used to walk out of the house. He used to get on 
my nerves. 

C: Why? 

P: | don’t know. He sits there telling me all of the faults 
of what they are doing or what they should have done, 
and it doesn’t interest me. And if | tell him | don’t want to 
hear about it, well, then he gets mad. | mean, if there are 
times when | want to talk to him, ifthe sports news is on or 
a political thing or something (closer to tears) he wants to 
watch, lve got to keep my mouth shut, but if there is 
something | want on and he wants to talk, I’ve got to sit 
there and listen or he gets mad. 

C: Is he a short-tempered person? 

P: | am. | don’t know, he is calmer (crying now). But it 
seems like he’s got to have more of his way. Not really his 
way, | don’t know just how to put it. If he wants help with 
something, you just have to jump to it right away or else 
he gets mad. When | do things, I just go ahead and do it. | 
don’t stop and ask whether | should do it this way or do it 
that way or do it all. But, with him he’s got to discuss every 
little detail. If he should do it or how it should be done. So 
often he’s got to have help with everything. 

C: Is he a perfectionistic person? It sounds as if he wants 
things to be done just so. 

P: lm more that way. | don’t know, maybe it comes 
down to money problems. We don’t really have money 
problems. But to him it’s a mighty big thing. At times | just 
like to go out and have a good time, go to a dance. But he 
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doesn’t dance. If | want to take the girls to a show, | do it 
myself. He’d rather sit home and watch those damn ball 
games. (Cries bitterly.) It seems that I can’t say anything. 
| can’t express my point of view. Because I’m never right. 
| mean, when it comes down to politics, well, he’ll say, 
“Vote this way. There is no sense in voting against me. 
That will only cancel both our votes.” | just don’t know 
what it is. Like, coming here today, he says the only thing 
my doctor sent me over for was to see whether there was 
something wrong with my throat. He says, “How many 
times are you going to come back here, run up a big 
bill?” He says, “This is it, this is the last time. You can’t be 
chasing back and forth.” There he’s telling me again 
(extremely angry) and, as far as I’m concerned, | want to 
get down to the bottom of it, to find out why I’m losing 
my voice. (Breaks down again.) And | don’t care how 
much money it costs! (Long pause while patient com- 
poses herself.) 


C: How long have you been married? 

P: Oh, alittle over 11 years. 

C: How do you basically feel about your husband? 

P: Oh, I love him. He is a good man. | don’t know, maybe 


the thing that bothers me the most is he’s too much of a 
perfectionist on how the house should be run. | know his 
own mother told me she was glad to get rid of him, 
because she could sweep the floor and he’d go right after 
her and sweep it again. This thing about having the house 
being just so, so often he’ll refer to back to being in the 
Army. | have to hang the shirts on the hangers all one way, 
all facing one way. The roll of toilet paper even has to be 
on one way. Even with the girls, if they don’t make their 
beds just right, he threatens them. He says that he’ll come 
up there and rip them all apart and start’em all over. He 
says, “That’s what they do to you in the Army.” So, | don’t 
know if it’s got to do with that or what. (Long pause.) But, 
| didn’t think it was any of this that was causing my trou- 
ble, | figured | was just working too many hours trying to 
be a good guy, trying to fill in until they got somebody 
hired but I just keep getting further and further behind in 
my housework, and this bothers me, too. 


Patients with psychogenic voice disorders are referred to 
speech-language pathologists who practice in hospitals as 


well 


as in clinics. 


Case Study 9.3 


The following experience concerns a young woman from 
a South American country who came to the United States 
for surgery unrelated to the larynx. She had had endotra- 
cheal anesthesia and awoke with a severely breathy dys- 
phonia. The laryngologist wrote: 


“She gets a good glottic closure and has a normal voice 
at times. Should be a functional [sic] dysphonia. Get 
speech pathology to see her.” 

The door to the hospital examination room opened, 
and anurse escorted the patient into the room and then 
disappeared. 


The patient collapsed into the chair opposite mine 
and then jackknifed forward, her head falling into my 
lap. She burst into great sobs, through which she man- 
aged to blurt out, “Oh, doctor, doctor, my family, they 
are driving me crazy!” Her sobbing subsided, replaced 
by great sniffling noises. Then she was quiet and finally 
sat up in her chair. | withdrew a clean handkerchief into 
which she noisily blew her nose, then smiled. Her eyes, 
although wet and red, were large and dark, and her skin 
had a reddish hue. Her lips were full, red, and etched in 
a cupid’s bow. When she smiled, her teeth shone in 
white contrast against her dark skin. She spoke again, 
“I’m sorry, doctor, for having troubled you like this.” Her 
English was excellent, delicately graced with a Por- 
tuguese accent. Her voice was severely breathy border- 
ing on aphonia, but it retained the faint outlines of pitch 
and inflectional changes, which gave her voice expres- 
sion. Her cough was sharp, giving evidence of her fun- 
damentally sound larynx. What did she mean, | asked, 
that her family was driving her crazy. Angrily, she told of 
how, just before | had arrived, she had a telephone call 
from her mother, who had reprimanded her for not 
telling her she was going to leave the country for sur- 
gery. 

Born in Brazil, the daughter of a coffee plantation 
owner, the patient had been educated in a Catholic 
girls’ school in the United States where she had met her 
husband, aged 38 years, 6 years older, who was also a 
student in the United States studying chemistry. She 
withdrew from her wallet a color photograph of her hus- 
band perched high upon a chestnut brown horse. He 
wore a red beret and an ascot and was, she said, an 
excellent polo player. She also showed me photographs 
of her two sons, 10 and 12 years old, both blond and 
handsome. 

Then she lowered her head, and when she looked up 
again, tears were in her eyes. Her jaw was clenched, and 
her eyes were wide, and, in what seemed like mock 
anger, she rasped, “But, he is like so many Latin men. He 
makes me so angry.” Then she stopped and smiled at 
me through her tears, her head cocked to one side, 
waiting for me to respond. “What do you mean?” “He 
treats me like a child, and he doesn’t pay enough atten- 
tion to me. When he comes home at night, he sits and 
reads the newspaper or a book and doesn’t speak a 
word.” Her lower lip bulged and set. “What else?” | 
asked. She paused. Her eyelids fluttered three times. 
She smiled sheepishly. “You'll think I’m silly.” “No | 
won't,” l insisted. Along pause. “Well, | don’t like him to 
look at other women. You are surprised?” She smiled, 
and proudly flicked her head obliquely upward. “You 
don’t understand Latin men. They all think they are 
great lovers and spend every minute trying to prove it. 
We were at this big banquet and he was sitting next to 
me and across from us was a girlfriend of mine, and he 
kept staring at her decolletage all evening!” We both 
laughed. 

Then we talked more seriously, and she told me she 
had, a year ago, lost a child at birth. Then 2 weeks 
before her surgery, she received a phone call from her 
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cousin that she too had lost her baby during childbirth. | 
tried symptomatic voice therapy but was unsuccessful. | 
saw her the day before she left the hospital, and except 
for her voice, she looked and felt well. | apologized for 
being unable to help her toward a more normal voice, 
but she reassured me that my visits, she called them 
“moral support,” were more helpful than | could realize. 

Three months later she wrote: 

The first time | had a real voice was in New York, 3 
days before coming home. | had a dream that night. | 
dreamed that | was sitting down in a drawing room with 
my cousin and his wife, the ones I told you had lost their 
second baby. | was doing all the talking, and | could lis- 
ten to my own voice. | was cheerful and happy, telling 
them all about my visit to the United States. Suddenly | 
told them: “How funny. Do you know what happened to 
me when | heard of your baby’s death? | lost my voice.” 
As soon as | said this | woke up and knew I could speak 
again. | screamed, and my own voice came out! You 
cannot imagine how happy and delighted | was. | could 
not believe it. It was 7 o’clock in the morning. Since 
then, my voice began to come back for moments: 15 
minutes, 10 minutes, half an hour. The day after the 
trip, which | made all by myself, | had no voice whatso- 
ever. | felt depressed, sad, and very ill. | arrived at my 
home after a long flight and was excited to see my hus- 
band, children, parents; happy to be back home and to 
think that all | had gone through was over. After my 
arrival, my voice improved gradually, over a period of 
days. It has been 3 months since | had my operation, 
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and sometimes when | get upset | notice a slight sore 
throat and | feel that my voice begins to change. But 
that is all. Luckily | feel very well; strong, happy, and so 
mentally different from what | felt last year. 


¢ Summary 


+ Knowledge of the principles and possession of the skills 
of psychologic interviewing and counseling are indispen- 
sable in the diagnosis of and therapy for voice disorders. 


¢ Virtually all patients with voice disorders, whether organic 
or psychogenic, have some degree of emotional disequilib- 
rium, either as a cause or as an effect of the voice disorder. 


è Many nonorganic voice disorders diminish or disappear 
during discussion of life stresses responsible for those 
disorders. 


¢ Sensitivity to more serious emotional illness in patients 
who have voice disorders can facilitate referral to proper 
mental health specialists. 


+ The speech pathologist who is most likely to succeed in 
providing psychologic support of the patient with a voice 
disorder is one who has a high degree of acceptance of 
self and others, is understanding, is skillful in asking 
questions, knows how to listen, is sympathetic and trust- 
worthy, and is persistent. 
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Chapter 10 


Studies in Clinical Diagnosis 


The first principle is that you must not fool yourself—and 
you are the easiest person to fool. 


—Richard Feynman, physicist 


The purpose of this chapter is to help the student develop a 
comprehensive grasp of the clinician-patient relationship in 
voice diagnosis and treatment. The clinician needs to know 
much more than “techniques” to successfully treat patients 
who have voice disorders. An understanding of the overall 
picture is indispensable because, although the range of voice 
disorder types in the human population is limited, no two 
individuals present their voice disorders in the same manner 
owing to the following variables: age and gender; duration, 
etiology, and severity of the voice disorder; concurrent phys- 
ical illnesses; personality; premorbid and concurrent psychi- 
atric predisposition; nationality and cultural background; 
education; socioeconomic level; occupation and social situa- 
tion; motivation; verbal ability; genetic heritage; and insight. 

Inherent, and necessary, in the traditional textbook format 
is that material be presented in a logical, orderly manner. 
Textbook knowledge is one thing; clinical thinking and 
action are another. In practice, the clinician becomes a detec- 
tive and is required to work backward from fragments of 
data: in this case, the victim is the abnormal voice, and the 
clinician-detective must use surface auditory-perceptual 
and visual-perceptual signs as means of reasoning toward 
probable abnormal laryngeal physiology and, even further 
remotely, to the underlying etiology behind that physiology. 


+ Voice Clusters 


Recognizing this practical necessity, the voice cluster con- 
cept was devised to help us organize our thinking about 
abnormal voice unknowns. The concept is based on the fact 
that abnormal voices have a tendency to group themselves 
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into perceptually distinctive colonies. A voice cluster is an 
abstraction, or prototype—it is no single patient’s voice. For 
example, one cluster is the husky breathy whispered- 
continuous group under which all breathy voices are gath- 
ered. The common anatomic denominator is that the vocal 
folds do not meet firmly to form a tight seal. The specific 
anatomic, physiologic, and etiologic reasons can be very differ- 
ent from patient to patient, however, and finding out the rea- 
sons will require further investigation. The point to be made is 
that we have to begin somewhere to categorize the sound of 
the voice in order to further our logical thinking about its 
cause. Following are eight distinctive voice clusters. After list- 
ing them, each will be discussed in detail. The clusters are: 


+ the husky breathy whispered-continuous group 
+ the strained hoarse-continuous group 


+ the strained hoarse voice arrest-intermittent arrhythmic 
group 

the strained hoarse voice arrest-intermittent rhythmic 
group 


the voice tremor group 
the breathy whispered-intermittent group 


the low-pitched hoarse group 


¢- ¢ è% o 


the high-pitched group 


Clinical case studies for the purpose of enabling the stu- 
dent to apply the cluster concept will be presented. These 
studies have been selected as much for their unusual as for 
their ordinary occurrence in clinical practice. A prevalent 
misconception abounds that the atypical case is not worthy 
of attention for the very reason of its rarity; specifically, that 
time should be spent on the more commonly occurring voice 
disorders. Experienced clinicians, particularly those who have 
medical differential diagnostic experience, will testify as to 
the danger of that idea. The reason is that the least frequently 
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seen voice problem almost always provides the clinician with 
the greatest chance of making a mistake. It is important to 
emphasize that the case studies selected here are not 
intended to be all-inclusive or systematic accounts of every 
kind of voice sign or disorder that could occur in clinical prac- 
tice. Rather, the objective is to give the student a sense of the 
often convoluted nature of the diagnostic and decision-mak- 
ing process. While reading these studies, the student should 
keep in mind that matters are not always as they appear to be 
and clinicians should look for alternative, and sometimes sur- 
prising, explanations to what might have begun as a simple, 
straightforward diagnostic unknown. These studies can be 
somewhat confusing and frustrating, and this is intentional, 
to reflect actual practice. At selected points during each pres- 
entation, questions are asked that will encourage the student 
to consider alternatives in problem solving. After each case 
study is a discussion that contains the answers to most of the 
questions raised during the narrative. They can be used as 
bases for class discussion. Case studies and answers also can 
be found in other chapters of the book. 


Husky Breathy Whispered Continuous Group (HBW-C) 


When we hear a voice that deviates from normal, the first 
question asked should be “What might be happening at the 
vocal fold level to produce this kind of voice?” When we hear a 
voice that is continuously or steadily husky, breathy, or whis- 
pered, the answer is “incomplete glottic closure” (Fig. 10.1). 

“What are the possible reasons for such incomplete 
approximation of the vocal folds?” One major and poten- 
tially lethal cause is a mass lesion that produces a rough or 
irregular surface on one or both vocal folds preventing them 
on closure from forming a smooth seal along the vibrating 
glottal margin and causing turbulent airflow that we per- 
ceive as breathiness (Fig. 10.2). 

“What specific kinds of lesions are capable of producing 
such interference?” The most common are polyps, nodules, 
webs, and carcinomas. Glottic insufficiency from postsurgi- 
cal loss of tissue, such as after cordectomy, can produce sim- 
ilar results, and so can loss of mass from senile bowing of the 
vocal folds or a sulcus vocalis. 

The clinician must ask, “How do these various conditions 
affect the glottal margin and ability of the vocal folds to 
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Figure 10.1 Flow chart of Husky breathy whispered continuous voice 
group (HBW-C) showing answer to first question of “What might be 
happening at the vocal fold level?” 








Polyp 
Incomplete Nodule 
HBW_C Glottic —— Mass f 
| HBW-C | piso Web 
Carcinoma 


Figure 10.3 Flow chart of Husky breathy whispered continuous voice 
group (HBW-C) illustrating some of the myriad of possibilities of masses 
that could cause incomplete glottal closure. 


vibrate?” The more irregular the surface or the greater the 
opening between the vocal folds, the greater will be 
the breathy component. The larger the mass of the lesion, 
the lower the pitch, unless the lesion is sufficiently stiff that 
it inhibits and limits vibration to a smaller area of the vocal 
folds. The more heterogeneous the vibrating surface of the 
two vocal folds, the more irregular the vibration and hence 
the harsher the vocal quality. Thus, in the case of vocal nod- 
ules, cysts, or polyps, most of the surface is smooth and the 
vocal changes less dramatic than in the case of a voice emit- 
ted from unilateral irregular-surfaced glottic carcinoma. 
Similarly, a large glottic gap with total absence of closure is 
going to be significantly more breathy and weak, often 
nearly atonal, compared with a glottal gap but near closure. 
Thus, there are a myriad of possibilities, and clinicians are 
not trying to make a diagnosis from their perceptual judg- 
ments but merely to begin making hypotheses to help them 
ultimately discover the etiology of the problem and need for 
additional diagnostic tests (Fig. 10.3). 

Whether any of these laryngeal pathologies is responsible 
for the breathy type of voice found within this cluster will be 
determined by endoscopic examination. 

Assume that such examination has taken place and has 
failed to identify any of the lesions just described. The next 
question should be, “What other causes could produce a 
voice that falls within this cluster?” The answer is neurologic 
disease, which is the next major cause within the husky 
breathy whispered (HBW)-cluster group (Fig. 10.4). 

“What is the reason that a neurologic lesion could produce 
a voice of this cluster description? Where could such lesions 
in the nervous system take place to produce such a voice? 
With which neurologic syndromes could such voice be asso- 
ciated?” 

Neurologic lesions can prevent adduction of the vocal 
folds. Any interference with functioning of the tenth cranial 
nerve will produce unilateral or bilateral vocal fold paralysis 
or paresis preventing their approximation (Fig. 10.5). 

Careful laryngologic examination will probably establish 
whether or not such adductor weakness has taken place. 

Assume that such examination has failed to disclose paral- 
ysis. We must then entertain another possibility, that of 
myoneural junction disease, or myasthenia gravis, having a 
prominent laryngeal representation (Fig. 10.6). 
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Figure 10.2 Flow chart of Husky breathy whispered continuous voice 
group (HBW-C) depicting second step in detecting possible causes of 
incomplete approximation of the vocal folds. 
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Figure 10.4 Illustration of an additional cause of Husky breathy whis- 
pered continuous voice group (HBW-C) that should be considered 
when by endoscopic exam the vocal folds appear normal. 
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Figure 10.5 Diagram of flow chart showing one probably cause of 
neurologic Husky breathy whispered continuous voice group (HBW-C) 
is either unilateral or bilateral paralysis of the X cranial nerve. 


Further neurologic examination, including motor speech 
examination, would be necessary to confirm or rule out this 
disease. 

A third possibility is early parkinsonism, because breathy 
voice is one of its first indications. The location of the lesion 
is within the basal ganglia (Fig. 10.7). 

Further possibilities are apraxia of phonation producing 
loss of recall for phonatory movements and frontal lobe dis- 
ease producing defects of personality, motivation, and 
insight (Fig. 10.8). 

The importance of a complete neurologic examination to 
clarify the differential diagnosis among these various possi- 
bilities is obvious. It should also be apparent that these dys- 
phonias are classifiable under such larger categories as 
dysarthria, apraxia, and dementia. 

The sequence of clinical diagnostic screening from organic 
to psychogenic causes follows a practical rationale based on 
three important premises: (1) It is mentally difficult to think 
about organic and nonorganic causes together without 
becoming disorganized and even confused. Thus, taking 
these two major categories one at a time will keep us from 
becoming entangled in a morass of conflicting clinical signs 
and symptoms. (2) The reason that organic causes are exam- 
ined first is because they are potentially life-threatening. 
(3) It is easier to communicate with patients and inform 
them of nonorganic causes once we are convinced no such 
organic causes are present, because all patients who have 
discomfort and/or sensory or motor changes believe their 
dysfunction is organic. 

At this point in our schema, let us say we have satisfied 
ourselves that the patient who presents with a voice of this 
cluster does not have any of the mass lesions or neurologic 
diseases suspected. “Where do we go from here in our 
thoughts and procedures to help explain this type of abnor- 
mal voice?” We must seriously consider that the abnormal 
voice is not caused by organic disease and begin to look for 
psychogenic, behavioral, or other medical causes such as 
reflux (Fig. 10.9). 

In order to find out if psychogenic causes exist, it is 
mandatory that a serious, probing psychosocial history be 
administered. In the broadest general terms, we are looking 
for evidence of acute situational conflicts or chronic life 
stress, time-related to onset and development of the abnor- 
mal voice. Depression, anxiety, and conversion reaction are 
the most common effects of such stresses. They produce 
laryngeal musculoskeletal tension, laryngeal elevation and 
muscle tension pain, and globus (Fig. 10.10). 

To summarize, the majority of voices that fall within the 
cluster under discussion can be analyzed according to the 
composite schema that follow in Fig. 10.5-Fig. 10.14 and Fig. 
10.17-Fig. 10.20. 
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Figure 10.6 Illustration that Myasthenia Gravis is an additional possi- 
ble etiology of neurologic based Husky Breathy Whispered Continuous 
Voice Group (HBW-C). 
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Figure 10.7 Flow chart depicting the additional neurologic causal 
possibility of Parkinsonism in the Husky Breathy Whispered Continuous 
Voice Group (HBW-C). 
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Figure 10.8 Flow chart showing common etiologies of neurologic 
Husky breathy whispered continuous voice group (HBW-C) that clini- 
cians need to consider in completing voice assessment. 
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Figure 10.9 Flow chart adding psychogenic as a possible reason for 
incomplete approximation of the vocal folds reminding clinicians there 
are numerous possible causes of Husky Breathy Whispered Continuous 
Voice Group (HBW-C). 
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Figure 10.10 Schema of some possible psychogenic causes of Husky 
Breathy Whispered Continuous Voice Group (HBW-C). 
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Figure 10.11 Composite schema illus- 
trating decision making branching in 
determining possible causes of Husky 
Breathy Whispered Continuous Voice 
Group (HBW-C). 
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It is important to note that while dividing the problems 
into organic and nonorganic provides a functional schema 
that aids clinicians in systematic decision making, most 
patients do not fall into a purely organic or nonorganic cate- 
gory. Clinicians need to be mindful that chronic stress may 
result in esophageal reflux, muscle tension, and associated 
lesions. Neurologic disorders and associated dysphonias 
may result in secondary depression and anxiety. Part of the 
clinical detective work is differentiating the primary and 
secondary relationships in the complex problem of vocal 
dysphonias (Fig. 10.11). 

The following specific case studies illustrate in detail the 
evolution of voice disorders requiring the kinds of procedures 
and reasoning heretofore discussed. After this first series of 
case presentations, schematic flow diagrams will be pre- 
sented for the remainder of the cluster groups, although 
without detailed commentary as in this one, allowing the 
student to work out the meaning of the charts independently. 


Case Study 10.1 


A 40-year-old, female, unemployed high school science 
teacher married with three children had a normal voice 
until 2 years prior to examination when she underwent 
thyroid surgery. Upon recovering from the anesthesia, 
she was alarmed to discover that her voice was almost a 
whisper, and she would choke on liquids unless she swal- 
lowed small amounts at a time. 


Q1: What do you suspect is the etiology of her dysphonia? 


Her nurse had written that her cough sounded “mushy 
and weak.” On laryngoscopic examination, the surgeon 
found a right unilateral vocal fold paralysis. He told her 
that it would be wise if she used her voice very carefully, 
whereupon he prescribed voice rest for 1 week. When she 
resumed talking, she found that her breathiness was as 
bad as or worse than before. She received no information 


Psychogenic 
Aphonia 


concerning the prognosis of her voice. The surgeon 
referred her for voice therapy. She was treated twice a 
week for 2 months and showed “considerable improve- 
ment,” but she retained a degree of breathiness that left 
her dissatisfied. She returned to her otolaryngologist, 
who recommended Teflon injection, but the patient, wary 
of further surgery, declined. 


Q2: What is the rationale for recommending Teflon 
(Teflon® Dupont Company, Wilmington, DE), fat, 
Gelfoam (Gelfoam® Pfizer Inc., New York, NY), or col- 
lagen injections? 


The psychosocial history is normal. She describes her 
relationships with her husband and children as happy, and 
she expresses much optimism about her life, but she does 
become upset by her voice, particularly under noisy con- 
ditions. She wants to return to classroom teaching and 
knows she will not be able to do so as long as her voice is 
abnormal. She is dismayed that her poor voice deprives 
her of pleasurable social conversation. She cries fre- 
quently during the examination. 

Her voice quality is moderately breathy, loudness level 
is reduced, and mild friction noise is heard during inhala- 
tion. She sustains the vowel /a/ with a remarkably low vol- 
ume. You are startled by her ability to triple her volume on 
demand, during which she produces nearly normal voice. 
Her cough and glottal coup are normal, and she no longer 
has difficulty swallowing. A repeat laryngoscopic exami- 
nation shows that the right vocal fold is still paralyzed. 


Q3: Explain, anatomically, why thyroid surgery should 
have produced a unilateral vocal fold paralysis in this 
patient. 


Q4: Drawasuperior view of the patient’s vocal folds during 
phonation immediately after surgery. 

Q5: How does your drawing explain the patient’s voice 
signs and symptoms immediately after surgery? 
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Q6: Why was she having trouble swallowing liquids? 


Q7: What is the explanation for the nurse’s note that, 
immediately after surgery, her cough sounded “mushy 
and weak”? 


Q8: How might you explain the fact that, on your examina- 
tion, even though the right vocal fold was still para- 
lyzed, her cough and glottal coup were normally 
sharp? 


Q9: How might you explain the fact that when the patient 
was asked to sustain vowels loudly, her volume tripled 
and her voice quality became nearly normal despite 
the persistence of her vocal fold paralysis? 


Q10: What information in the history gives a clue to the eti- 
ology of her persistent dysphonia? 


Q11: Is there an alternative hypothesis as to the etiology of 
her persistent dysphonia? 


Q12: From the data given, how would you proceed with 
therapy? 


Discussion 


This patient’s voice disorder teaches an important lesson 
about the psychologic repercussions of organic voice dis- 
orders. Here is a woman who obviously had a unilateral 
vocal fold paralysis after thyroidectomy, a not uncommon 
occurrence owing to the proximity of the thyroid gland 
and the recurrent laryngeal nerve. She was frightened 
when she awoke from the anesthesia to discover that, in 
effect, she could not talk normally and had problems swal- 
lowing. The emotional trauma of sudden voice deprivation 
in a basically healthy woman who has the responsibilities 
of a family and a career, both of which place a high demand 
on the ability to speak, may not be as well appreciated as it 
ought to be. Such anxiety is further heightened by the fact 
that no one could give her a satisfactory answer to her 
question as to when, or even whether, her voice would 
return to normal. The surgeon then places her on voice 
rest, which communicates to her that not using her voice is 
good, which must mean then that to use it is bad. The 
speech clinician who subsequently sees her for therapy 
and brings about voice improvement fails to reverse the 
patient’s concept of guarded voice use, and she reaches a 
plateau in therapy that she never surmounts. 

The clue to her potential for better voice should have 
been the discovery that the patient’s cough was dispro- 
portionately sharp in contrast with her breathiness and 
reduced loudness. This suggests, despite the persistence 
of her unilateral vocal fold paralysis, there is good com- 
pensatory approximation of the normal vocal fold against 
the paralyzed one. This was subsequently confirmed by 
laryngoscopic examination. The final proof of her 
untapped capacity for a nearly normal voice was when she 
produced a loud, clear voice on demand. Her discovery 
that she could do this provoked in her genuine surprise; 
she had not been aware of her potential. What she had 
done, quite unconsciously, was to partially shut down her 
depth of respiration and force of vocal fold adduction in a 
misguided attempt to avoid damaging her vocal folds. 
Audible sound during inhalation also suggests that the 





paralyzed fold was in the median or paramedian position 
making compensatory adduction easier. 

Therapy consisted of using behavioral modification 
techniques to increase loudness and helping the patient 
divest herself of the notion that vigorous use of her voice 
was bad. On the contrary, the more she used it, the better 
her voice would become. She was given supervised prac- 
tice using a much louder, clearer voice. She wrote 1 
month later saying that her voice was nearly normal under 
most speaking conditions. She still had a unilateral vocal 
fold paralysis but was able to compensate with her unaf- 
fected vocal fold and better respiratory support. 


Case Study 10.2 


The 47-year-old wife of an American industrialist living in 
London, England, came to the clinic because she has 
been unable to find a satisfactory explanation or help for 
the following complaint: During the past 6 months, her 
voice had been getting softer. It had reached the point 
where people had to ask her to repeat what she was say- 
ing. She was examined by two laryngologists before com- 
ing to the clinic. One said her vocal folds were normal; the 
other said that the posterior region of the folds were not 
closing completely but could make nothing of this find- 
ing. He described her voice complaint as “functional.” 

She had just been reexamined by a laryngologist and, 
for the first time, a neurologist. Neither was able to diag- 
nose her problem. The laryngologist wrote, “Her voice 
sounds pretty normal to me.” The neurologist, having just 
completed her residency, was perplexed by the case 
because the complaint was confined to the patient’s voice 
and she “didn’t have much else to go on.” She eventually 
decided that the voice complaint was “nonorganic.” The 
referral note said, “Evaluation and therapy for functional 
voice disorder.” 

The patient was a tall, exceptionally well-groomed 
woman whose manner and language revealed a keen 
intelligence and capacity for detailed recall. She reported 
that she was at a cocktail party for her husband’s execu- 
tive friends when she first noticed she could not speak 
loudly enough to make herself heard over the noise. Since 
that occasion, more than 6 months ago, she has had sev- 
eral similar episodes, and within the past 3 months she’s 
noticed it even when there was little or no extraneous 
noise. She stated that the problem was no longer just 
occasional but was “present more and more of the time.” 

The psychosocial history was completely unremark- 
able; the patient and her husband were well suited to one 
another. Their children were grown and successfully on 
their own; their daughter was an attorney for a utilities 
company, and their son was finishing a surgical residency. 
The patient admitted that when she was under the 
stresses of everyday life, her voice was noticeably worse. 

Voice evaluation revealed that her conversational loud- 
ness level was, in fact, reduced, and her voice was mildly 
breathy or husky. Her cough was so near normal that you 
are unable to consider it anything other than that or, at 
most, a mild variant of normal. One thing you note is that 


she does not use much pitch or loudness variation. When 
you point this out to her she quickly agrees, because a 
close friend had told her just the other day that, on the 
phone, she sounded “flat and depressed” and had 
expressed concern about the patient. 


Q1: What might be the diagnostic possibilities of the voice 
findings obtained up to this point? 


Q2: What is the diagnostic significance of the finding that 
the patient’s voice is worse when she is under stress? 


You ask her to sustain the vowel /a/ as long and steadily 
as she can. The first time she does this, you note that her 
voice definitely is breathy, but you think you hear some- 
thing else, a faint irregularity of some kind. On the second 
and third trials, you are certain that buried within the 
breathy quality is a rapid tremor, so subtle as to escape 
any but the most discriminating ear. 


Q3: What conclusions might you draw about the breathy 
voice quality? 

Q4: What are the possible interpretations of the rapid 
tremor? 


Q5: Why was the tremor heard only on vowel prolonga- 
tion? 


On routine motor speech examination, you find that 
her alternate motion rate and regularity on /pa/, /ta/, Ika] 
repetitions are normal, but here, again, you find yourself 
further perplexed that she does not open her lips very 
widely as she repeats the syllables. 


Q6: Comment on the possible significance of the lip move- 
ments just described. 


Q7: How might this finding relate to those accumulated 
thus far? 


Finally, because you have been taking special note of 
the patient’s facial expression during the latter part of the 
examination, you remark to yourself that it is not expres- 
sive, except when she speaks with emotional emphasis or 
smiles. Toward the end of the examination she volun- 
teers, “for whatever help it might be,” that her golf game 
has deteriorated during the same period as her voice 
complaint. 


Q8: What do you think are the possible causes of the 
patient’s voice disorder? 


Q9: What steps would you take at this point in the exami- 
nation? 


Discussion 


This study illustrates the importance of being sensitive to 
the fact that signs of neurologic syndromes in the very 
early stages of their evolution often appear to others as 
“functional,” and that we are not often blessed with their 
full, classic clinical characteristics. Here is a diagnosis that 
was missed by four physicians, primarily because the 
patient’s signs and symptoms were limited to her speech. 
This is common in early voice signs of neurologic disease; 
after all, breathiness and reduced loudness are also signs 
of psychogenicity. That is why as clinical detectives, we 
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must leave no stone unturned. We must investigate the 
physical, social, neurologic, psychologic, environmental, 
and genetic possibilities. We must look for any additional 
signs, which, when assembled, may form a pattern that 
might explain the patient’s dysphonia or other chief 
speech complaint. The first clue we get about this patient 
is that her demeanor does not fit with the typical patient 
with psychogenic or “functional” dysphonia. Her psy- 
chosocial history was unremarkable, she was happily mar- 
ried and financially secure. We then learn that, in addition 
to her breathiness, one laryngoscopic examination 
showed that the interarytenoid portions of the vocal folds 
were not approximating completely. In addition to the 
breathiness, we have evidence of a loss of pitch and loud- 
ness variations. During the speech examination, she had a 
barely detectable, rapid voice tremor on vowel prolonga- 
tion that could never have been detected during running 
speech owing to its mildness. The failure of her lips to 
make normally wide excursions on alternate motion rate 
for [pa] now became an important finding. The informa- 
tion that her voice was worse under stress proved nothing 
either way; patients with both organic and psychogenic 
voice disorders report this complaint as often do speakers 
without voice complaints. Thus, only after putting these 
findings together did her lack of facial mobility take on 
importance. 

By now it should be apparent that the suspicion of early 
parkinsonism has been raised. You telephone the neurol- 
ogist who had last examined her and ask if you might 
speak with her about your findings. You tell her that the 
patient’s voice and associated speech findings may 
be suggestive of early hypokinetic or parkinsonian 
dysarthria, which might be revealing itself primarily via 
her larynx. After expressing ill-disguised surprise, she 
thanks you and suggests that another neurologist who 
specializes in Parkinson’s disease examine the patient. He 
does. His note reads: “This patient has mild masking of 
facies, mild rigidity of her upper extremities to passive 
movement, and mild reduction in arm swing and distance 
between her steps as she walks. This, in all probability, is 
early Parkinson’s disease. The mildness of her signs does 
not warrant any treatment at this time. The patient 
should be scheduled for reexamination in 6 months.” 


Case Study 10.3 


A 24-year-old airline stewardess comes to you for advice 
concerning vocal nodules. The referring laryngologist 
describes “bilateral well-developed, probably encapsu- 
lated, nodules in the middle of the membranous section of 
the vocal folds.” The patient went to the laryngologist after 
a friend had advised her that she ought to have an exami- 
nation because someone else she knew had a similar voice 
quality and she was found to have serious organic disease. 
The patient says that her voice has been “kind of husky” 
for as long as she can remember. She was a cheerleader in 
high school and describes herself as one who talks inces- 
santly; even when she is alone, she talks to her dog. She 
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does not smoke and drinks only socially with moderation. 
On examination, her voice has a noticeably breathy-husky 


quality. 


Q1: What is the physical explanation for the patient’s 
“breathy-husky” voice quality? 


Q2: What is the most likely etiology of her vocal nodules? 


Q3: Do you feel comfortable making a therapy decision at 
this point? If so, what do you think ought to be done? If 
not, what further information do you think you need? 


The psychosocial history is as follows. After high school 
graduation and 2 years of college, she was accepted for 
flight attendant training. One year later, she married 
an accountant and continued to work at her airline job. 
She says her marriage is “quite satisfactory,” although 
recently she has been unhappy about her husband having 
to work long hours because of increased responsibilities. 
She wishes she could see more of him. She conveys the 
impression that there are no deeper problems. 

As far as she is concerned, her voice has never been a 
problem. Other than being unable to sing high notes, she 
has no voice limitations. Many people even tell her they 
like the quality of her voice. Actually, her main reason for 
coming to the clinic was that she feared a disease of her 
larynx, not because she wanted help for her voice. 


Q4: Do you think this patient is a candidate for voice ther- 
apy? Why? 


Q5: Should this patient’s vocal nodules be removed surgi- 
cally? Why? 

Q6: What would you have done if the laryngologist had 
recommended surgical removal? 


Discussion 

The “breathy-husky” voice quality in this patient is typical 
of those who have vocal nodules. The voice quality is the 
effect of the nodules projecting medially into the glottis 
from the vocal folds so that when they approximate dur- 
ing phonation, the vocal edges do not form a complete 
seal; instead, spaces are left in the region of the nodules 
allowing air to escape during phonation, hence the 
breathy quality. At high pitch, when only the edge of the 
vocal fold cover participates in vibration, the lesions may 
approximate momentarily stopping vibration, much like 
would happen if you touched a vibrating guitar string. The 
nodules may also cause a slight increase in mass that con- 
tributes to the perception of a husky voice. From the his- 
tory provided, and the laryngologist’s description of prob- 
able encapsulation, it would appear that these nodules 
have been present for a long time. Quite likely, they were 
formed when the patient was a cheerleader or before. Her 
greater-than-average voice use and working as a flight 
attendant in a dry, noisy environment have probably 
caused them to persist. 

There is serious doubt that voice therapy is necessary or 
even desirable. The patient is satisfied with her voice even 
though most clinicians would agree it was defective. In 
many vocal abusers, excessive voice use is inseparable 
from their basically extroverted personalities, making 





voice modification difficult to achieve. If she were moti- 
vated to change, teaching methods of projection without 
abuse coupled with vocal hygiene might diminish the 
abuse but would not modify the excessive amount of talk- 
ing. However, even if she were motivated, reducing vocal 
abuse probably would not significantly diminish the size 
of the nodules owing to their encapsulation. In view of the 
poor prognosis for reducing encapsulated vocal nodules 
by means of voice reeducation, there is a strong tendency 
to advocate surgical removal. Although in some instances 
this might be advisable, there are risks associated with 
surgery; there may be scarring or other tissue defects, and 
the dysphonia may remain or even become worse. More- 
over, if the surgical treatment brought about a normal 
voice, the patient might be unhappy with the outcome 
because she currently gets positive feedback from friends 
stating they like her voice. 


Case Study 10.4 


A patient is sent to you with the following history. She is a 
52-year-old woman who complains of general fatigue and 
episodes of voice loss for about 2 years. Twenty years ago, 
she had an episode of anxiety, hyperventilation, and a 
sense of impending doom. She was helped with psycho- 
logic counseling and has had only rare occurrences of 
these complaints ever since. Three years ago, she choked 
on a carrot; said it stuck in her windpipe and that she 
nearly died. Two years ago, she began to have what 
she describes as “severe fatigue, no stamina, generalized 
weakness, a need to walk slowly—like I’m 90 years old.” 
She says her symptoms are getting progressively worse. 

When she talks on the telephone for long periods, her 
“voice loses volume and becomes a whisper” accompa- 
nied by chest discomfort and a pulling sensation in her 
epigastrium. She feels more fatigued and weak during 
these spells, and three or four times she felt light-headed 
when she had them. At first these episodes occurred only 
once or twice a week, but now they occur daily “anytime | 
talk.” All of her symptoms are now worse. 

The neurologist who wrote these findings also noted 
that when he asked her to count to 100, her “voice deteri- 
orated to a whisper.” 


Q1: At this point, what causes of her voice disorder might 
you consider? 


During your examination, you discover that her voice is 
intermittently breathy during contextual speech. Her 
cough is normally sharp. She complains of pain in the 
region of her larynx. She gives no evidence of hypernasal- 
ity or nasal emission, and her articulation is normal. You 
ask her to count vigorously from 1 to 150. She becomes 
nearly aphonic by the time she reaches 60 but with no 
signs of hypernasality or articulatory disturbance accom- 
panying the voice change. Her cough during her aphonia 
retains its sharpness. 


Q2: Does this patient show the classic signs of the 
dysarthria of myasthenia gravis? 


Q3: Whatis the neuroanatomic type of dysarthria that can 
accompany myasthenia gravis and what are its classic 
signs? 

Q4: Is it possible for someone to have myasthenia gravis 
localized to the larynx only? 


Q5: Ifthe voice had deteriorated to aphonia for reasons of 
muscular weakness after the stress test of counting, 
what do you think the quality of her cough ought to 
have been? 


You discuss your findings with the neurologist, who 
suggests that Tensilon (ICN Pharmaceuticals, Inc., Costa 
Mesa, CA) be administered after voice deterioration but, 
in this case, he suggests injecting sterile water prior to the 
Tensilon test. 


Q6: Why do you think the neurologist wanted to inject ster- 
ile water before Tensilon? 


The neurologist inserts the needle of a syringe filled 
with sterile water into the patient’s arm, but she has not 
been informed that water is the substance being injected. 
Her voice, which begins as a whisper after having been 
first fatigued, becomes steadily louder and clearer as she 
counts, even before the neurologist has injected the 
water. The patient complains of “feeling funny.” The 
water is then injected with no further effect. 


Q7: What do you conclude from the events just described? 


Q8: How do you think you (the speech-language patholo- 
gist) and the neurologist should proceed with the 
patient from this point? 


Further discussion with the patient reveals, for the first 
time, that ever since she had been placed in a cardiac 
intensive care unit 2 years ago for cardiac arrhythmia, she 
has been chronically anxious about her heart. Even 
though the cardiac findings were diagnosed as benign, 
she still becomes extremely anxious whenever her heart- 
beat is irregular. 


Q9: Discuss the relevance of this latest information to the 
patient’s total neurologic and speech signs and symp- 
toms. 


Q10: What further action do you think should be taken? 


Discussion 


On the face of it, any speech deterioration during pro- 
longed or effortful speaking ought to bring to mind myas- 
thenia gravis. The question that often confronts us is 
whether a given patient with myasthenia gravis needs to 
demonstrate all of the classic signs of flaccid dysarthria; 
that is, whether all speech systems—respiratory, phona- 
tory, resonatory, and articulatory—need to be implicated 
before we can reasonably suspect this type of dysarthria. 
The answer is “no.” Although rare, bulbar myasthenia 
gravis can become manifested at the laryngeal level only, 
so it is reasonable that this patient was reasonably sus- 
pected of having the disease, especially when her voice 
had deteriorated during stressful counting. But, all find- 
ings did not fit with the idea of progressive muscular 
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weakness as the underlying cause of her bouts of dyspho- 
nia. Rarely, if ever, does the myasthenic patient’s voice 
deteriorate into a complete whisper, as this patient’s had. 
However, most telling was the preservation of her sharp 
cough during the time she was aphonic. This exhibits an 
inconsistency and is a clinical red flag telling you some- 
thing in the picture does not fit. The typical patient with 
flaccid dysphonia from true muscular weakness of the 
laryngeal adductors will produce a weak or “mushy” 
cough proportionate to the degree of dysphonia present. 
The neurologist suspected that something about his 
entire examination did not fit with organic disease 
and when presented with evidence of the sharp cough 
decided to do a placebo test—sterile water instead of 
Tensilon—first to see if the patient would respond as if she 
had been given the drug. Our suspicions were confirmed 
when the needle had been inserted into the vein and her 
voice began to improve as if she had been given Tensilon, 
even before the water had been injected. The test was dis- 
continued with confirmation of psychogenic dysphonia. 

It is curious that a small percentage of patients will 
develop nonorganic musculoskeletal reactions (which in 
some cases are conversion reactions) that simulate neu- 
rologic syndromes relatively well-known to the general 
public, such as myasthenia gravis or multiple sclerosis. 
Often, the idea is inadvertently suggested by prior med- 
ical consultations or by the patient’s experience with a rel- 
ative or friend who had the disease. 

In this patient, the reason was never found as to why 
she happened to acquire a nonorganic dysphonia that 
resembled myasthenia gravis, but the neurologist 
strongly suspected that it had something to do with her 
anxiety about her heart. His course of action from that 
point was to schedule a complete examination by a cardi- 
ologist for a new opinion, with the intent of educating the 
patient as to her true cardiac status. Held in reserve was a 
psychiatric consultation should the cardiologist’s coun- 
seling prove ineffective. 


Strained Hoarse Continuous Group (SH-C) 


Case Study 10.5 


A 47-year-old, male, community college economics 
instructor comes to you with the complaint of hoarseness 
that began developing at the onset of the fall semester. 
Now Christmastime, his voice is worse than it has ever 
been. He thinks he has been under stress because of a 
change in his teaching duties, having been given increas- 
ing responsibilities owing to the loss of a faculty member. 
He finds that about halfway through a lecture, his hoarse- 
ness increases and he feels pain in his throat. He recently 
visited an otolaryngologist who said that his vocal folds 
looked nearly normal except for some slight reddening. 
Until his voice disorder began 4 months previously, he had 
been in excellent health and had had no previous trouble 
with his voice in spite of an active teaching career spanning 
20 years. On voice evaluation, you hear a continuously 
strained hoarse voice quality. On vowel prolongation, 
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there is a “wet” sounding noise superimposed on the 
hoarseness, and, at times, it sounds as if there is a very sub- 
tle, rapid tremor or “flutter” in the patient’s voice. The psy- 
chosocial history reveals that he is happily married with 
two teenage children, both of whom are doing well in 
school. His wife is a social worker employed part-time. The 
patient admits that he has always been high-strung, that 
lecturing has always been extremely important to him, and 
that he derives most of his work satisfaction from teaching, 
finding students stimulating and challenging. This past 
semester has been more upsetting than previous years 
owing to having to teach two additional courses, which 
interferes with preparation for his lectures and reduces the 
amount of time he has to spend with individual students. 
Being compulsive and meticulous, he becomes anxious if 
he has not prepared adequately for every lecture. 


Q1: What possible etiologies of this patient’s dysphonia do 
you entertain? 


Q2: What evidence can you muster to support your posi- 
tion? 


Q3: What is the significance of the specialist’s observation 
of erythema of the vocal folds? 


Q4: What significance do you give to the observation that 
his voice sounded “wet” on vowel prolongation? 


Q5: What interpretation do you give to the observation 
of a rapid tremor or “flutter” during vowel prolonga- 
tion? 


Q6: How would you progress with the examination and the 
management of this patient from this point? What is 
your rationale? 


The patient’s cough sounded reasonably normal. How- 
ever, his glottal attack on a vowel seemed to be produced 
with less than normal percussion. 


Q7: If valid, what interpretation would ordinarily be given 
to a less than normal glottal attack on a vowel? 


Q8: Why would the cough be sharper than the glottal 
attack on a vowel? 


You decide to perform an oral-physical examination. 
With a tongue depressor, you stroke the patient’s lips 
from the angles of the mouth toward the center and think 
you see a slight pursing of his lips. Upon opening his 
mouth, you find that his mouth glistens with a coating of 
saliva, and you ask if he has noticed any wetness of his pil- 
low upon arising in the morning. Upon protruding his 
tongue approximately one-half inch beyond his incisor 
teeth and resting it there, you think you see an irregular 
surface of his tongue along both sides. Every now and 
then you see dimpling movements in different areas of 
the surface of the tongue along the sides. 


Q9: What interpretation do you give to the wetness on the 
surface of the mouth and tongue? 


Q10: What is the significance of the question about the wet 
pillow in the morning? 


Q11: What is the meaning of the irregular tongue surface 
along the sides? 





Q12: Of what relevance are the dimpling movements scat- 
tered over the edges of the tongue? 


During the interview, you note further that two or three 
times when you touched on personal issues, the patient 
seemed to be on the verge of crying. When asked about 
it he said, almost puzzled by his own loss of control, that 
he had been doing that quite often during the past 2 
months, sometimes with only the slightest provocation. 
You note also as he speaks that in addition to the hoarse- 
ness, his prosody is somewhat lacking in pitch variations. 


Q13: What thoughts come to mind about the patient’s ten- 
dency to cry easily? 


Q14: Does the observation of a tendency toward monopitch 
contribute anything to your differential diagnostic 
thinking? 


Continuing with a routine motor-speech examination, 
you are further alerted to the possibility that on testing 
alternate motion rate for /pa/, [ta], and /ka/, his ability to 
repeat these syllables may not be quite within the normal 
speed range, something that did not appear abnormal 
during contextual speech. You decide to measure his rate 
and discover that for the /pa/, his maximum rate is 5 Hz; 
for /ta/, 4 Hz; and for /ka/, his rate fluctuates between 3 
and 4 Hz. 


Q15: What significance do you give to the alternate motion 
rates obtained? 


Further background information reveals that the 
patient has been having difficulty lately swallowing liq- 
uids, tending to choke occasionally unless he drinks 
slowly. His gag reflex, on your continued motor-speech 
examination, is quite hyperreflexive, and, yet, even 
though you do not hear any hypernasality, he has mild 
nasal emission producing moisture on a mirrored surface, 
confirmed with nasal airflow studies. 


Q16: What is the potential significance of the patient’s 
occasional tendency to choke on fluids? 


Q17: In view of the hyperactive gag reflex, of what signifi- 
cance, if any, is the mild degree of nasal emission 
observed? 


Q18: What diagnostic possibilities are you entertaining con- 
cerning this patient’s voice disorder? 


Q19: What decisions would you make concerning further 
procedures to be taken with this patient? 


Discussion 


The case depicted here is not unusual. A patient comes 
with a focal voice disorder in which most or all attention 
by various clinicians has been directed toward the larynx, 
obscuring other deviations of speech, which admittedly 
can be borderline and appear normal until scrutinized 
carefully. Such was the situation with this patient who, 
because of no mass lesions and because of symmetric 
movements of the vocal folds, was considered to be free 
of organic disease. The inflammation noted by the oto- 
laryngologist is common in patients, and especially in 


those who have a tendency toward vocal misuse or who 
may have esophageal reflux; in this case, it is considered 
to be insignificant. It would have been reasonable for the 
average clinician to attribute the strained hoarse dyspho- 
nia to vocal misuse in a teacher under stress in his job, but 
concurrent events are not necessarily causally related, 
once again, highlighting the necessity of looking carefully 
at the whole picture so as not to identify a correlational or 
contributing factor as the causal factor. 

Piecing together, the evolving historical and examina- 
tion findings the clinician should have become increas- 
ingly suspicious that there was more going on with this 
patient’s speech mechanism than a phonatory disorder. 
In retrospect, we see a picture forming of a man develop- 
ing emotional lability, dysphagia, reduced ability to swal- 
low accumulating saliva, possible tongue atrophy and 
fasciculations, velopharyngeal insufficiency (which can 
occur even in the presence of an active gag reflex), a posi- 
tive sucking reflex, and borderline slow reciprocating 
muscle movements as demonstrated on alternate motion 
rates. 

The differential diagnostic choices in this patient are 
multiple (Fig. 10.12). He may be misusing his voice, he 
may be compensating for an underlying laryngeal disor- 
der, he may be exhibiting a psychogenic voice disorder, or 
the signs and symptoms may reflect very early spastic 
(pseudobulbar) dysarthria-dysphonia having a major 
laryngeal-phonatory expression. 

The decision at the termination of the speech patholo- 
gist’s examination ought to have been, and was, to refer 
the patient for a neurologic examination, whereupon 
signs of spasticity were found elsewhere in the body, as 
was evidence of muscle denervation on electromyo- 
graphic examination, implicating the lower motor neuron 
system in addition to the upper motor neuron pyramidal 
system. The neurologist’s diagnosis was motor neuron 
disease, probably amyotrophic lateral sclerosis. 
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Strained Hoarse Voice Arrest-Intermittent 
Arrhythmic Group (SHVA-IA) 


Case Study 10.6 


A 32-year-old woman enters your office. Three chairs 
line the wall, each extending further from your desk. She 
sits on the one farthest from you, piling her coat, purse, 
and paperback book on her lap. She smiles, sits erectly, 
and leans away from you (Fig. 10.13). 


Q1: What early inferences might be made about this per- 
son based on the information given? 


Q2: Which statement to the patient do you think would be 
the most appropriate to make at this juncture in the 
examination?: 


A. “I see you’ve been reading the latest bestseller. I’ve 
been wanting to get it myself. What do you think of it?” 


B. “Your doctor sent me a report about your voice dis- 
order. From his letter, it looks as if you’ve been 
abusing your voice quite a bit lately.” 


C. “You might be more comfortable if you put your 
belongings on that chair and sat next to my desk. I 
think we'll be able to converse more easily that way.” 

Clinician (C): What brings you to the clinic? 

Patient (P): My voice. It’s been like this for 3 months, and 
it’s like about to drive me up the wall. 

C: Could you describe it? What’s different about it now 
compared with the way it used to be? 

P: Gee, that’s going to be hard . . . Well, | guess it’s kind 
of hoarse. Well, not exactly hoarse; it’s different from the 
hoarseness | usually get from laryngitis. It’s, uh, kind of 
strained or raspy; like I’m pushing or pulling everything 
from way down here. (She lays the flat of her hand on her 
abdomen.) It’s very tiring. Sometimes it gets so bad | just 
don’t talk. 


Adductor 
Spastic 
Dysphonia 
(Psychogenic) 


hoarse continuous group (SH-C). 


Figure 10.12 Flow chart of the multiple 
differential diagnostic choices of Strained 
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C: How did this voice trouble start? 

P: It actually began with a cold and sore throat. (Laughs 
wryly.) Then, the cold went away, but the hoarseness did- 
n’t. My neck feels sore, too. (She runs her hand from her 
chest to under her chin.) And, | get this pain in my ears. 
The doctor looked but he couldn’t find anything wrong 
with my ears or my lymph nodes. 


Q3: Why didn’t the clinician launch directly into a voice 
evaluation instead of conducting a preliminary discus- 
sion? 


Q4: Even though you have not performed a formal voice 
evaluation, what might you deduce about your 
patient’s vocal fold activity from her description of her 
voice? 


Q5: What do you think is the physical basis for her com- 
plaints about increased efforts to talk? 


Q6: What do you make of her statement that her voice 
trouble persisted? 


Q7: What might be the significance of her complaints of 
neck and ear pain? 


C: So, if I have this correctly, your voice trouble began 
about 3 months ago with a cold and persisted after the 
cold cleared. | want to go into the history of the voice 
disorder in greater detail, but before we do, | want to 
examine thoroughly your speech mechanism and 
voice. 

C: First, read this paragraph aloud while | tape record 
your speech. 

P: (She reads the “Grandfather Passage” aloud.) 

C: Now | want you to sustain the vowel /a/ as long and 
steadily as you can. Do that 3 times. 

P: (As she reads the passage you hear irregular, fluctu- 
ating moments of strained hoarseness and slight voice 
stoppages. On vowel prolongation, her voice is also 
strained but not as much as during oral reading. You hear 
moments of clearer voice, which is even more apparent at 
higher pitch levels.) 

C: All right, now will you cough as sharply as you can? 





Adductor Figure 10.13 Differential diagnosis 
Spastic schema for Strained hoarse voice arrest - 
Dysphonia intermittent arrhythmic group (SHVA-IA) 
(Neurologic) illustrating steps in detecting cause of 
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P: (She does so with a sharp, well-defined glottal attack.) 

C: What we want to do now is to examine your speech 
mechanism. I’d like you to sit up straight and rest your 
hands in your lap. Look straight ahead. 

P: (The clinician observes her during complete rest for 
several seconds.) 

C: Now, open your mouth, but keep your tongue at rest 
on the floor of the mouth. 

P: (The tongue is of normal size and shape and is sym- 
metric.) 

C: Now, gently rest the tip of your tongue over your 
lower teeth. 

P: (The tongue surface remains quiescent.) 

C: Stick your tongue straight out of your mouth. 

P: (She does so symmetrically.) 

C: Wiggle it from side to side as fast as you can. 

P: (These movements are performed rapidly with good 
range of motion.) 

C: Put your tongue in your left cheek and don’t let my 
finger press it in. 

P: (This is done in the right cheek as well. Her tongue 
resists pressure well.) 

C: Press your tongue forward against the tongue 
depressor as | push in. 

P: (She does this with good strength.) 

C: Lift the tip of your tongue and touch your upper lip. 

P: (This is performed normally.) 

C: Hold your mouth open quietly again while | look at 
your soft palate. 

P: (It hangs quietly and symmetrically.) 

C: lm going to take this tongue depressor and stroke 
the sides of your soft palate and pharynx. 

P: (She gags easily and both sides of the soft palate rise 
symmetrically and with good medial pharyngeal wall 
movement.) 

C: Open and close your mouth. 

P: (She does so without any deviation of the mandible.) 


Q8: What words might you use to describe the patient’s 
voice signs? 


Q9: What kinds of information did you expect to obtain 
from asking her to read a standard paragraph aloud? 


Q10: What kinds of information did you expect to obtain 
from asking her to sustain a vowel sound as long as 
she could? 


Q11: What is the significance of her clearer voice at a higher 
pitch level? 


Q12: What is the purpose of asking her to cough as sharply 
as possible? 


Q13: In light of the fact that she has been referred to you 
with a voice disorder, for what specific reasons did you 
perform a complete oral-physical examination? 


Q14: What was the purpose of asking her to sit as quietly as 
possible while you observed her head, neck, and other 
bodily structures? 


C: If you will look straight ahead, now, I’m going to feel 
your neck. 

P: (Her larynx is rigid; it is difficult to move from side to 
side. Her thyroid cartilage is pulled up high in her neck, 
and her thyrohyoid space is narrow. You encircle the hyoid 
bone with the thumb of your left hand over the tip of the 
left major horn of her hyoid bone and the middle finger 
over the right, and pressing inward, rotate your fingers 
over this area.) “Ow. Do you have to do that?” (Then, with 
the thumb and middle finger over the superior borders of 
the alae of the thyroid cartilage and your index finger in 
the thyroid notch, you try to maneuver the larynx down- 
ward, away from the hyoid bone. As you are doing this, 
you ask the following.) 

C: How does that feel? 

P: Oh, it hurts. It really does. (As she says this, you hear 
a sudden clearing of her voice. She looks at you with sur- 
prise.) Say, that sounded different (her voice returning to 
its strained hoarseness). 


Q15: Whatis your interpretation of the elevated larynx, laryn- 
geal rigidity, and diminished thyrohyoid space? 


Q16: Name the muscles most likely to be involved in these 
physical findings. 

Q17: Why did she respond with pain when you pressed your 
fingers and rotated them in the hyoid region? 


Q18: How does this information contribute to your under- 
standing of the etiologic diagnosis of her voice disor- 
der? 


Q19: What is the significance of the momentary clearing of 
her voice as you pulled downward on her larynx? 


C: Let’s continue with the history of your voice trouble. 
Tell me, from the time it began, is it getting better, worse, 
or staying the same? 

P: It fluctuates. Sometimes it’s better, sometimes 
worse. 

C: How about time of day; does that affect it? 

P: Yes. It’s usually better in the morning. In fact, there 
are times when | first get up in the morning it’s normal, 
and that may last for 15 to 20 minutes. 

C: Have there been other times that the voice has 
returned to normal? 
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P: Yes. About a month ago it was normal almost the 
whole time | was ona trip alone to San Francisco, but it got 
worse again after | came home. 

C: So, your voice disorder is not a steady thing; it fluctu- 
ates with time of day and where you are. Your doctor 
wrote in his referral letter that you are in TV production. 
Tell me about that. 

P: lm an assistant manager. 

C: How long have you been in that kind of work? 

P: About 10 years. I joined the station right out of college. 

C: Do you use your voice much in your work? 

P: (She smiles and angles her head toward you for 
emphasis.) You bet I do. I’m on the phone all day talking to 
salesmen, technicians, personnel. Some days by late 
afternoon | can barely talk. 

C: And, have you always had to use your voice a lot in 
your work? 

P: Yes. It’s been that way from the beginning when | 
took the job. 


Q20: What is your reaction to the information that the 
patient’s voice fluctuates depending upon situations 
and time of day? 


Q21: What is your reaction to the information that at times 
the voice has returned to normal for several minutes to 
several days? 


Q22: What conclusions do you draw about the possible eti- 
ology of her voice disorder from the information that 
she uses her voice far more than the average person in 
her daily work? 


Q23: With the information you have at this point, are you 
satisfied that you understand the etiology of her voice 
disorder and are ready to begin therapy? If so, what 
procedures would you next initiate? 


Q24: If you are not satisfied that you have the entire picture 
of her voice problem, what further information would 
you wish to obtain? 


C: Let’s stop at this point and review what we have so 
far. Did your doctor explain the results of his laryngologic 
examination? 

P: Yes. He said my vocal folds were normal except for 
some slight reddening that couldn’t account for my 
abnormal voice. He said the redness could have come 
from a cold or it could be due to irritation from abusing 
my voice. 

C: Good. Because | want you to understand that basi- 
cally there is nothing physically wrong with your vocal 
folds. There are no growths, infections, or paralyses that 
might account for your voice trouble. 

P: Well, I’m certainly glad to hear that. You hear so 
much about cancer and things like that. | was really wor- 
ried .. . But, then why does my voice sound this way? Are 
you going to tell me this is all in my head? 

C: That’s a good question, because, here you have this 
very abnormal-sounding voice and we're telling you 
there’s nothing physically wrong with the voice mecha- 
nism. 

P: Yes. And, that feeling of stiffness and pain—if that’s 
not physical, what could it be? 
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C: | appreciate how confusing this must be, so, | want to 
introduce an idea to you that, for the time being, might 
help to explain your complaints. A person can have con- 
siderable changes in voice along with real feelings of stiff- 
ness and pain because of excess contraction of the throat 
muscles. Most people have had similar tension and painin 
other parts of the body. When such tensions occur in the 
throat muscles, they can cause changes in the voice along 
with these abnormal feelings. 

P: All right. So, what causes the tensions? | think I’m 
pretty well adjusted. (Slight note of irritation in her man- 
ner at this point.) 

C: Let’s talk about that, if we may. Specifically, was 
there anything unusual in the way of personal problems 
when you started having trouble with your voice? 

P: Not that | can think of. | was working as usual. Every- 
thing is going real well at the job. In fact, | just got a pro- 
motion and a raise. No, there’s nothing. (As the patient 
tells you this, she averts her gaze and frowns, looking 
down at her lap. There is a long pause.) 


Q25: Which of the following steps would you take at this 
point? 
A. Accept her denial of problems and continue on to 
another subject. 


B. Point out to her that she seems upset by some- 
thing. 

C. Discontinue the examination and make another 
appointment. 


D. Wait and say nothing. 


(She looks up at you, smiles and shrugs her shoulders as 
if to say: “That’s it; nothing else comes to mind.”) 

C: This is a difficult question for most people. Some- 
times we are unaware that certain things are bothering us. 

P: Well, like what kinds of problems are you referring 
to? | mean, uh, | guess maybe there has been something 
bothering me, but | didn’t think it would have anything to 
do with my voice. (Pauses) Now that you mention it, | have 
been unhappy in a situation that | haven’t done anything 
about. Normally, when I’m unhappy in a situation, | 
change it. (Pause) The marriage. 

C: Would you mind telling me more about that? 

P: Well, he is very... he doesn’t want to accept the fact 
that we don’t have a marriage, and it’s very difficult for 
him. He doesn’t understand a lot of things about it which 
| understand. I’ve never been a nervous person or shaky, 
you know, about anything, and just since we’ve been mar- 
ried, I’ve been fighting divorce or separation. For my hus- 
band’s sake, | want to stay married. 

C: What, in your opinion, is wrong with the marriage? 

P: | guess the fact that | don’t like closeness. | loved 
being single. My favorite things are to ride my horse out in 
the woods for days or read books. Of course, | love being 
with people also. | am an executive, so | have to be with 
people. But, marriage isn’t anything like | thought it would 
be. 

C: What did you think it was going to be like? 

P: Well, | thought it would be a great deal of sharing 
emotionally and materialistically—all the things that you 





share—but | didn’t think it would be sharing everything, 
meaning the husband; he doesn’t want to do anything 
unless | do it with him. To me, it has been very draining, 
because | never in my life, even when | lived with a big fam- 
ily, had all that closeness. I’ve never had that much close- 
ness in that way; to me, it’s like 24 hours of . . . (pauses, 
searches for the right word). 

C: Being stifled? 

P: Yes. And I’ve told him that and explained it to him 
and he doesn’t do it purposely. | think it’s the first time 
he’s ever been in love, and he’s gotten carried away with 
it. And the difference being he’s a person that needs me 
and | don’t need him. | don’t need him in the way, | mean, 
to fulfill my life, and that is a big difference in our relation- 
ship. 

C: Did you have trouble confronting your husband with 
your feelings about the marriage? 

P: Not at all, In fact, before we got married, | was really 
fighting getting married. 

C: Why did you? 

P: Iwish | knew, because it just happened real suddenly, 
and my husband is very persuasive, and I’ve always really 
cared for him, too. (Long pause) 

C: How and when did you meet him? 

P: We were attending the same advertising conven- 
tion. That was about 9 months ago. He’s older, 52, and 
has made a lot of money and had been recently divorced. 
It was a sort of a whirlwind courtship, at least for me. He 
sort of pursued me during the whole convention. We 
started going out together. After that, he was very insis- 
tent; said he needed me desperately. We were married 
after 3 months. 

C: So, that was about 6 months ago, and your voice 
trouble started 3 months ago. But, in spite of your feelings 
about wanting to remain single, you went ahead with the 
marriage anyway. Did you express your misgivings to your 
husband before you were married? 

P: Very much. I told him | was not a touchy-feely per- 
son. | mean, I’m loving, and | have a lot of love to give, and 
| can accept love, but he is starved for love in the way that 
he really needs constant patting on the head and a lot of 
attention, and a lot of what | can’t, | guess, give. And, | 
don’t know if there is any reason for him to lack that just 
because of me. 

C: Do you and your husband have much in common? 

P: Oh gosh! We have everything in common. We are 
both very adventuresome; we love to travel, camp, ski. 

C: Do you think your voice is better right now than 
when we started? 

P: Much better. 

C: I’m glad you recognize that. It’s 50 to 75% improved 
since we began discussing this marital situation. Do you 
feel better about having talked about that? 

P: Yes. Much better. Up until now, | kept rejecting talk- 
ing to anybody professional about it. | kept thinking, well, 
whatever it is it will go away tomorrow, because tomor- 
row would be fine. | got to the point where it was so frus- 
trating not being able to converse and express or even go 
to a counter and order a bottle of perfume without some- 
body asking, “What are you saying?” This confirms my 


feeling that my voice has something to do with keeping 
things in. In other words, I’ve always been a happy person, 
and I’m always jovial and never depressed, and just lately 
I’ve been in a state of depression. 

C: Can you estimate for how long? 

P: Well, l’d say about 3 or 4 months. 

C: That’s about the time you’ve had your voice problem. 

P: | feel like it has. Because | feel like it, in other words, as 
| said earlier, I’ve never been a nervous person. Well, lately 
I’ve been very, very tense. | lay down to go to sleep and try 
to sleep and then just start thinking. 

C: Do you think our discussion has been helpful? 

P: I’m sure it has. Before, | thought this voice thing was 
physical, but | realize now | don’t have something physi- 
cal. | feel more comfortable about that. | also feel more 
comfortable because until now, | hadn’t talked to any- 
body at length about this. 


Q26: What would you say next to the patient? Why? 


C: How do you think we should proceed from here? 

P: (She becomes silent for almost a minute, thinking 
deeply.) Obviously, my voice is better. Sounds almost nor- 
mal, now. | came to have my voice helped, but here | am 
with an entirely different problem. That | didn’t expect. 

C: That’s something you’re going to have to deal with, 
isn’t it? What do you want to do? 

P: | think a separation is worth a try; be away from each 
other for 6 months or so and for me to seek whatever ther- 
apy | need during that 6 months. 

C: Therapy? What did you have in mind? 

P: Counseling, marital counseling | suppose. 

C: Do you know of anyone locally who could help? Or, 
would you prefer that we arrange for you to see someone? 

P: No, | know someone who’s really good, has a good 
reputation. May | ask that you send a report of your exam- 
ination to her? 

C: Of course. And, I'll send a full report to your laryngol- 
ogist within the next few days. 

P: But, do you really think all this has caused my voice 
trouble? 

C: What do you think? 

P: (She smiles.) | think it has. 

C: l'Il say goodbye, then. Please keep in touch and let 
me know how things turn out? 

P: | will. | promise. 


Q27: Is there a relationship between her marital problem 
and her voice disorder? Or, are they just coincidental? 
What precipitating factors might have contributed to 
the problem? 

Q28: How do you think the clinician in this case handled the 
situation? 

Q29: Do you think the patient was in need of psychiatric 
help? Explain. 

Q30: What formal diagnosis would you give to her voice dis- 
order? Why? 

Q31: What do you think is the prognosis for her voice disor- 
der and her relationship with her husband? Why? 

Q32: Should the patient have been given conventional voice 
therapy? 
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The patient wrote the following letter to the speech- 
language pathologist: 


It has been 2 months since | came to see you about my 
voice disorder, and | hope you will recall the circum- 
stances surrounding the disorder. As we had decided 
then, | went to see a psychiatrist in my home town who 
specializes in marital problems. Both my husband and 
myself had several sessions with her, and after getting 
our differences into the open, which took several con- 
sultations, we decided, with the approval of the psychi- 
atrist, to separate, This happened 3 weeks ago, and 
while my voice had remained improved after having 
seen you, it continued to do so after our separation and 
as | write my voice is completely normal. 

| would like to thank you for all your help and for 
enabling me to see what was bothering me. 


Instructions 
1. Write a clinical report of your examination of this patient. 


2. Write a letter to the referring otolaryngologist. 


Sample Report of Clinical Voice Examination Mrs.___ is 
a 32-year-old television station assistant manager who was 
referred by Dr.____ , laryngologist, with a diagnosis of a 
“voice disorder secondary to vocal abuse.” Based on his 
laryngoscopic examination, except for some evidence of 
vocal fold irritation, the vocal folds were essentially normal 
in structure and movement. 

The patient began noticing signs of strained voice quality 
approximately 3 months after her marriage. It has persisted 
since then, although it fluctuates, depending upon time of 
day, fatigue, and whether or not she happens to be experi- 
encing an optimistic or pessimistic outlook. On one occasion, 
the voice was completely normal while she was alone on 
vacation. 

Oral-physical examination failed to disclose any evidence 
of facial, lingual, velar, or mandibular weakness or asymme- 
try. She gave no evidence of unusual movements about her 
face or other parts of her head, neck, or extremities. Articula- 
tion, both during contextual speech and on alternate motion 
rate, failed to disclose any evidence of a motor speech disor- 
der. 

The physical history indicates that although 
Mrs. __ has been under considerable pressure at 
work using her voice a great deal, this situation has existed 
for many more years than the short duration of her current 
voice disorder. After initial reluctance, she disclosed that she 
was dissatisfied with her marriage owing to its confining 
nature. As the patient disclosed this information her voice 
gradually improved. She recognized the improvement and 
understood the relationship between her voice disorder and 
her martial dissatisfaction. 

We discussed the advisability of her obtaining counseling 
in order to help her resolve her marital problem. Sympto- 
matic therapy of the voice disorder would be ill-timed in 
view of her emotional conflict and its priority over her 
abnormal voice. 
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We decided that the patient would seek professional help 
for her marital problem. She will keep us informed as to her 
progress. 


Sample Letter to the Otolaryngologist Thank you for your 
kind referral of Mrs._ . I have enclosed the report 
of my clinical speech evaluation. While I think that vocal 
abuse might have exacerbated her voice disorder, it was my 
impression, as noted in the enclosed report of clinical exami- 
nation, that the fundamental cause of her dysphonia has been 
psychologic and related to an unresolved marital problem. I 
think you would be gratified to learn that her dysphonia 
improved significantly during her disclosure of her dissatis- 
factions, and that she recognized the relationship between 
her voice disorder and these tensions. My impression was 
that symptomatic voice therapy would have been premature 
at this point inasmuch as her entire attitude shifted from 
attention to her voice to that of her martial dissatisfaction. 

I offered the patient the names of psychiatrists and clinical 
psychologists in her local area, but she elected to see some- 
one of her acquaintance with an excellent reputation. | will 
keep you informed as to her progress. I suggested that she 
contact you at some date in the future to let you know how 
she is doing. 

If there is any further information that you would like to 
have, please do not hesitate to call or write. 


Discussion 


It is usually a good idea to begin with general conversation 
rather than going directly into the mechanics of the voice 
examination. To do so is not only a sign of personal interest in 
the patient but also gives the clinician an impression of the 
patient’s total communication in addition to the voice (i.e., 
resonation, articulation, language, ideation, and affect). 

From the transcript, we read the description “strained or 
effortful.” Although it is difficult to know what the voice was 
like without hearing it, the term strained ought to have con- 
jured up hyperadduction of either the true vocal folds, false 
vocal folds, or both. Complaints about increased effort to 
talk along with strained hoarseness is also evidence of vocal 
fold hyperadduction. The increased physical effort indicates 
that the patient had to exert abnormally intense thoracic 
and abdominal muscle contraction in order to force air 
through the hyperadducted vocal folds. The patient’s com- 
plaint of neck and ear pain is often a sign of chronic excess 
intrinsic and extrinsic laryngeal muscle contraction. 

The statement that the voice disorder began with an 
upper respiratory infection that disappeared whereas the 
voice disorder persisted is not an uncommon finding asso- 
ciated with the onset of voice disorders. It is often consid- 
ered to be indicative of psychogenic dysphonia, as a high 
percentage of psychogenic voice disorders begin in this 
way. At the onset of the upper respiratory infection (URI), 
there may have been some laryngeal edema for which the 
patient needed to compensate. Whether there are sec- 
ondary gains of extra attention or reduced workload from 
having laryngitis or simply that the extraneous muscle 
effort was maintained to the detriment of voice or some 
combination of the two is part of what the clinician 
attempts to determine. 


Laryngeal elevation, rigidity, and diminished thyrohy- 
oid space are usually signs of chronic hypercontraction of 
the extrinsic, and probably the intrinsic, laryngeal mus- 
cles and are associated with musculoskeletal tension dis- 
orders. The muscle groups most likely involved in such 
laryngeal malpositioning are the suprahyoids and infrahy- 
oids. Pain when the tissues around the hyoid bone and 
thyroid cartilage are kneaded supports the inference of 
excess musculoskeletal tension. (Many clinicians are 
reluctant to touch the patient, let alone cause pain. Yet, it 
is essential to determine if, in fact, pain and tension exist if 
a proper etiologic diagnosis is to be made.) 

Momentary clearing of the voice as the larynx was 
pulled downward reinforces the suspicion of a muscu- 
loskeletal tension voice disorder. That the patient’s voice 
fluctuated depending upon situation and time of day fur- 
ther supports psychogenicity; however, it must be 
strongly emphasized that organic voice disorders also 
vary in this manner, although they rarely return to normal. 
That her voice returned to normal, not just improved, 
from several minutes to several days, almost certainly 
indicates that the entire voice disorder is psychogenic. 

From the written descriptions of the voice, such words as 
spastic, spasmodic, hyperfunctional, or strained hoarseness 
come to mind. Asking the patient to read a standard para- 
graph aloud not only provides information about how the 
dysphonia fluctuates with the production of phonemes in 
different contexts, but it also reveals other aspects of motor 
speech. Asking the patient to sustain a vowel provides 
information about the quality, pitch level, and steadiness of 
the voice. During such steady-state activity, the clinician 
can listen carefully for subtle voice aberrations and for fluc- 
tuations in laryngeal-respiratory control. Also, in certain 
voice disorders, vowel prolongation can produce a very dif- 
ferent voice than during contextual speech, which may 
have important differential diagnostic significance. 

The fact that the patient’s voice was clearer at higher 
pitch levels should have led the clinician in the direction of 
considering the voice disorder to be one type of adductor 
spasmodic dysphonia, in which there is strained voice at 
conversational pitch levels but clear voice at higher ones. 

Asking the patient to cough or produce a sharp glottal 
attack provides important information about her ability 
to adduct her vocal folds adequately. In patients who have 
organic weakness, the cough and glottal coup lack sharp- 
ness. A complete oral-physical examination is always 
important, even when the clinician suspects that the dis- 
order is confined to the larynx, because motor-speech dis- 
orders can begin with a dysphonia, predominately, while 
very careful examination of remaining motor-speech 
structures may disclose the beginnings of a dysarthria in 
other regions of the speech musculature. 

Asking the patient to keep her mouth, tongue, and jaw 
at rest while the clinician observes her head and neck 
structures is important to determine the presence of sub- 
tle neurologic movement disorders that have a tendency 
to show themselves only when the patient is quiescent. 

Three factors suggested there was a definite link 
between the patient’s marital problem and her voice 
disorder. Only during her marriage did she develop the 


voice disorder for the first time in her life. Her voice 
became normal while she was away from her husband. 
Her dysphonia improved during the interviews as she 
described the conflicts created by her marriage. There 
was little question that the patient needed counseling. 
She was in a serious conflictual situation that was already 
having somatic repercussions, and it was apparent that 
she was unable to manage this situation on her own. 

Based on the intermittent strained voice quality and 
associated laryngospasm, the history, and the changes in 
voice, it would be appropriate to conclude that this 
patient had adductor spasmodic dysphonia, psychogenic, 
probably of the conversion-reaction type. 


Case Study 10.7 


A 43-year-old married woman with grown children who 
worked for a real estate firm came to the clinic for help 
with a voice problem of 1-year duration. She describes her 
voice as “tense, strained, and high in pitch, and some 
words won’t come out at all.” Her disorder began gradu- 
ally with a nondescript hoarseness that continued to dete- 
riorate to its current strained quality. It had improved on 
occasion; she had been receiving voice therapy “to pro- 
duce a more relaxed and clearer voice.” Whereas she 
found that she could, in fact, produce a clearer voice with 
concentration, these techniques did not hold up in most 
speaking situations outside of the clinic. She volunteers 
that her voice disorder has interfered with her supervisory 
responsibilities at work and has been equally damaging to 
her social life. This most important ability had gone out of 
her life, she said, and at fleeting moments during the past 
few months, she has had suicidal thoughts. 

Lately, she has been complaining of “a tight neck and 
shoulder,” and that especially when driving, she has 
noticed a tremor of her head. At about the same time, she 
began to notice a tremor in her voice, but this was prior to 
its worsening. 

Voice examination revealed a pronounced strained- 
strangled voice, but vowel prolongation was clear and 
steady, except at a higher pitch level where mild tremor 
was detected. During the oral-physical examination, she 
had intermittent head tremor and a tendency for her head 
to turn to her right side. 

The psychosocial history was nonproductive. She was 
not under any particular stress or anxiety at work, and her 
marital and other domestic relationships, including those 
with her children, were pleasant and free of all but the 
usual day-to-day variations in stress. 

The ear, nose, and throat examination report came 
back with the notation, “normal laryngoscopic examina- 
tion; functional dysphonia.” 


Q1: From this description of the patient’s voice, what ter- 
minology would you use to describe the voice signs 
and symptoms? 


Q2: What muscular actions within the larynx and sur- 
rounding structures could be producing this kind of 
voice? 
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Q3: Whatsignificance, if any, would you give to the patient’s 
complaint of turning of her head and head tremor? 


Q4: What are your thoughts, your reactions to the exami- 
nation report of a normal larynx and that the disorder 
was “functional”? 


Q5: With the information that you have in hand, what next 
steps would you take in the management of this patient? 


Because of her complaint of mild turning of her head 
and head tremor, a neurologic examination was sched- 
uled. After a complete examination, the neurologist sum- 
marized his findings as follows: “For the past 2 years, the 
patient has been aware of a sense of aching in the back of 
her neck in the region of the left trapezius muscle. A few 
months after the onset she began to be aware of a ten- 
dency for her face to turn to the left and the vertex of her 
head to the right, which she can voluntarily straighten 
with her left hand. No one else observed these move- 
ments owing to their mildness. She is most aware of this 
movement while driving a car and watching television or a 
movie. Also a few months after the onset of aching in her 
neck, she became aware of a horizontal head tremor, 
intermittently, most noticeable when she was in a motor 
vehicle. A few months after that she began to detect a 
tremor in her voice. Laughing or yawning transiently cor- 
rected the aberrant head posturing. Because of her voice 
disorder, her future career is at stake, and if something is 
not done about her voice she may lose her position in her 
firm. She admits to a reactive depression to her voice that 
has also caused her to become withdrawn. In addition to 
the voice and head and neck findings, it is also important 
to note that she holds a writing instrument tightly and 
comments that, within the past year, her penmanship has 
declined. The remainder of the examination is normal. 
Impression: The neurologic examination gives evidence 
of extremely mild, but observable, spasmodic torticollis 
and associated head tremor.” 


Q6: What is spasmodic torticollis? 


Q7: What is the significance of the head tremor in associa- 
tion with spasmodic torticollis? 


Q8: What is the connection, if any, between the spasmodic 
torticollis, head tremor, and the patient’s voice disor- 
der? 


Q9: What is the significance of the change in her writing? 


Conference with both the laryngologist and the neurolo- 
gist discloses that there is no therapy that could be suggested 
from the standpoint of their specialties; that there were some 
drugs that have been used to treat a variety of movement dis- 
orders but that none had shown definite promise. 


Q10: What steps would you now initiate with this patient in 
order to help her with her voice disorder? 


Q11: The patient is now back in your office and it is your 
responsibility to summarize the results of everyone’s 
examinations and to counsel the patient further per- 
taining to future therapy, if any. What would you tell 
the patient as to the nature of her voice disorder, its 
cause, and the options for treatment? 
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The patient was told that she had a disorder that has 
gone by the name of adductor spasmodic dysphonia, and 
that in some cases the disorder is psychologic and in some 
cases it is neurologic. In her case—in addition to the fact 
that previous psychiatric and psychologic studies had failed 
to show any significant psychopathology, and that the neu- 
rologic examination had disclosed mild but positive signs 
of a movement disorder—in all probability (although no 
one can say with complete certainty), this was an adductor 
spastic dysphonia of the dystonic type involving both torti- 
collis and tremor. Because previous therapy performed by a 
competent speech pathologist had failed to produce an 
enduring improvement in her voice, the patient was told 
the only remaining therapy option available at that time 
was the recurrent laryngeal nerve resection procedure. 

The patient, it turned out, had studied this disorder and 
its causes and she knew about the surgery for it. She 
expressed skepticism owing to the conflicting reports about 
its effectiveness, especially in the long-term, and was con- 
cerned about the kind of voice quality that would result 
from the surgery. On the other hand, she said that she was 
thoroughly “fed up” with the voice, that she had to do some- 
thing, and that she was willing to risk failure if there was any 
chance the surgery might prove to be of benefit for her. 

After evaluating videotapes of previous patients who 
had undergone the surgery, the patient decided to pro- 
ceed with the lidocaine injection procedure. 


Q12: What is the purpose of the lidocaine injection? Explain, 
physiologically, its effects on the patient’s vocal folds, 
the kind of voice that will result from the injection, 
what it can predict about the effects of surgery, and 
indications and contraindications to surgery based on 
the injection procedure. 


Sitting upright in the laryngologist’s examining chair, a 
needle filled with 1% lidocaine was injected in the region 
of the cricothyroid space on the right side of the neck. 


Q13: Why was the injection made at this particular 
anatomic site? 


Q14: How soon after the injection should the voice change? 
Q15: How long should the effects of the injection last? 


Q16: How can you tell if the injection has paralyzed the 
vocal folds? 


Q17: What kinds of voice testing material should be used to 
enable patients and clinicians to evaluate the effects of 
the injection? 


It required four injections before the laryngologist was 
able to hit the nerve and the voice became smooth, 
slightly breathy, and completely free of spasm. The 
patient expressed immediate gratification at these voice 
and physical changes and said that she wished to proceed 
with surgery without further need for contemplation. The 
patient underwent a preanesthesia medical examination 
during the afternoon of that day, entered the hospital at 
suppertime, and went into surgery at 7:30 the next morn- 
ing. The patient was anesthetized, a primary, collar incision 
was made at the base of her neck, and the right recur 
rent laryngeal nerve was identified. After certainty of 





identification by mechanical stimulation of the presumed 
nerve and looking for momentary vocal fold movement 
via direct laryngoscopy, a section of the nerve was taken 
and the neck was closed. The pathologist’s report identi- 
fied the specimen as nerve tissue. 

During the next few hours, the patient became increas- 
ingly alert, remained in her hospital room that night, and 
was wheeled to the speech pathologist’s office the next 
morning for a 24-hour postsurgery evaluation. 


Q18: What would you expect the patient’s voice to sound 
like at this postsurgical period? 


Q19: What would you tell the patient when she asks 
whether this is the way the voice is going to be or 
whether it will change? 


Q20: What advice should be given to the patient at this 
point as to how to use the voice? 


The 24-hour postsurgical voice was breathy, and each 
time the patient increased the intensity of her voice, it 
became raspy and contained falsetto pitch breaks. She 
expressed satisfaction, at this point, mostly with the 
increased ease of voice production. She looked calm, her 
facial expression having changed from one of strain to 
one of relaxation. Her voice was recorded. 

The next morning, 48 hours after surgery, her voice had 
lost some of the breathiness and was louder. She looked 
less groggy and continued to express approval of the 
sound and feel of the voice. Her voice was recorded again 
and she returned to her hospital room. 

On the third morning, 72 hours after surgery, her voice 
had continued to improve in clarity and loudness. She had 
just been discharged from the hospital and was going 
home. She was advised to use her voice fully and not to be 
afraid of using it, although loud abusive talking would 
obviously be ill-advised. One week later the patient wrote: 


It is still difficult for me to believe this “miracle” has 
really happened. Each day is a gift from God. The most 
wonderful change is that | can again enjoy being with 
people | love rather than wishing they would go away 
and let me have some peace. 

lam still a little hesitant to talk too much; | don’t want 
to “wear it out.” 

| can barely remember the constant fatigue and 
depression | felt only a week ago. The future looks 
bright and sunny. | believe faith and a realistic expecta- 
tion of success will keep this “miracle” working. 

There are no words adequate to express our grati- 
tude. You have touched our lives in a very special way. 


One year after surgery, her voice could be described as 
nearly normal. It is smooth, melodious, free of any hint of 
spasm, and, most importantly, has remained consistently 
so without any sign of deterioration." 


"In the ensuing years, much has changed in the medical world. Hospi- 
talizations are shorter and additional treatment options such as 
botulinum toxin injections and thyroplasties are available. Recurrent 
laryngeal nerve procedures remain a viable option, often combined 
with laser thinning of one of the vocal folds, and as in this case is suc- 
cessful with some patients. 


Q21: What accounts for the voice improvement as time 
passes after surgery? 


Q22: Can all patients who have recurrent laryngeal nerve 
surgery expect the degree of improvement experi- 
enced by this patient? What would you say to a patient 
who asked about the probabilities of improvement as 
a result of surgery? 


Discussion 


This patient had the neurologic adductor spastic dyspho- 
nia of the dystonic variety with associated tremor. The 
strained-hoarse arrests of the voice arose from intermit- 
tent hyperadduction of the true and false vocal folds. The 
first clue to the neurologic nature of her spastic dysphonia 
was the tremor in her voice and head and the complaint of 
turning of the head. Tremor and spasmodic torticollis 
commonly occur in association with each other, and 
adductor laryngeal spasms can often be a sign of either 
disorder. The patient’s complaint of difficulty grasping a 
writing instrument may have indicated some dystonic 
involvement of her hand. After ruling out primary psy- 
chogenic factors and obtaining a neurologic examination, 
and knowing that previous efforts toward voice therapy 
were unsuccessful and that the patient was approaching 
the desperate stage of her experience with the disorder, it 
was obvious that something more drastic needed to be 
done. Informing the patient as much as possible was nec- 
essary. The advantages and disadvantages of surgery 
needed to be presented, preferably with actual case 
demonstrations to show the patient what might be 
expected from the surgery. 

The purpose of the lidocaine injection is temporarily to 
paralyze one vocal fold, simulating the surgery. The kind 
of voice that results from the injection is not exactly like 
that from the surgery owing to the frequent involvement 
of the superior laryngeal nerve affecting the cricothyroid 
muscle. The patient has to be informed of this fact, and 
the rule of thumb is that if the patient can accept the lido- 
caine voice, acceptance of the postsurgical voice will cer- 
tainly be as good or better. The injection is made in the 
cricothyroid space because that is the entrance site of the 
nerve into the larynx. The injection lasts anywhere from 
10 to 45 minutes, depending upon how much of the anes- 
thetic infiltrates the nerve. The laryngologist can tell if the 
injection has paralyzed the vocal fold by performing a 
laryngoscopic examination after injecting the anesthetic. 
The speech pathologist can hear the loss of sharpness of 
the cough, and the voice changes—sometimes rather 
suddenly—to a husky smoothness, and strained voice 
arrests suddenly disappear. The voice should be recorded 
during this period for a further evaluation using the same 
test materials as those used prior to evaluation, usually a 
standard paragraph, vowel prolongation, and production 
of coughing and the glottal coup. 

The immediately postsurgical voice is usually breathier 
and weaker than after the passage of several days owing to 
the fact that the glottis is widest at this time and that the 
intact vocal fold has not yet learned to compensate. The 
patient needs to be reassured that this immediately post- 
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surgical voice will not be permanent and that improve- 
ment can be expected within a matter of days. Because 
there is some evidence to indicate that maintenance of a 
higher, smoother voice may help to avert spasm in the 
future, the patient is advised to phonate in a slightly higher 
pitch than usual and that this be felt in the “mask” region 
of the face. Feeling in the mask region indicates good res- 
onance balance. Finally, it is always wise to maintain con- 
tact with the patient and request follow-up recordings or 
return visits to monitor the progress of the voice, to pro- 
vide counsel and voice therapy, as necessary. 


Strained Hoarse Voice Arrest-Intermittent 
Rhythmic Group (SHVA-IR) 


Case Study 10.8 


A 47-year-old married woman, referred by her family prac- 
titioner, complains of a voice disorder that she has had 
since age 25 years. Until 5 years ago, it was intermittent, 
but since then it has been persistent. She describes her 
voice as tremulous. One doctor years ago said she proba- 
bly had parkinsonism. 

The internist’s note states that her voice symptoms 
occurred at first only when she was excited or under 
stress, but over the years they have become persistent. 
Tension or stress aggravate her voice disorder. Although 
not disabling, it is embarrassing. 

The patient has had hypertension for 4 years, which is 
controlled by medication. Her dentition is in poor repair 
with numerous caries, and she has gingivitis. There is no 
history of diabetes, hypothyroidism, or any other meta- 
bolic disease. Her cardiovascular status is normal. She 
complains of heartburn once or twice a week. She had a 
hysterectomy 10 years ago when she was 5 months preg- 
nant. She has had hot flashes for 2 to 3 years, which are 
not troublesome. She does not take estrogens. She has a 
mild suprapatellar aching arthralgia when she sits for long 
periods of time. 

The patient divorced her first husband 15 years ago 
because of physical abuse. Her present marriage is satis- 
factory. She is employed full-time as an office administra- 
tor. She and her husband have four children. 

Laryngologic examination fails to disclose laryngeal dis- 
ease, and the laryngologist thinks her tremulous voice 
may be “functional” (Fig. 10.14). During speech examina- 
tion, you find a pleasant, mildly obese woman who says 
her voice disorder was intermittent when it started, but 
she has become concerned about her voice recently 
because it has grown worse over the past 3 years. It is now 
a definite communication problem and interferes with 
her work as an administrator. 

During conversational speech, she has staccato voice 
arrests. When she sustains the vowel /a/, you hear rhyth- 
mic voice arrests of the type heard during contextual 
speech. You record the vowel prolongation on audiotape 
and obtain an oscillographic tracing (Fig. 10.15). Speech 
intelligibility is poor owing to the voice arrests. You also 
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Adductor Figure 10.14 Decision making schema 
y g Spastic for possible causes of Strained hoarse 
Organic (Essential) Tremor — Dysphonia voice arrest-intermittent rhythmic group 
Intermittent (Sevete) (Neurologic) (SHVA-IR). 
Rhythmic 
SHVAAIR Complete © — Neurologic 
Vocal Fold Palatopharyngolarygeal 
Hyperadduction Myoclonus 
(Severe) 


notice an almost imperceptible head tremor and possibly 
tremors around her lips and chin as she speaks, but which 
are absent when she is silent. Except for her dysphonia, 
motor speech examination is normal. The patient 
straightforwardly denies any personal problems during 
the onset or development of her voice disorder. 


Q1: From the oscillographic tracing, what is the approxi- 
mate frequency of the patient’s voice arrests? 


Q2: What is the significance of the observation that the 
patient’s voice, head, lips, and chin are tremorous dur- 
ing speech but not during silence? 


Q3: What could be the connection between the voice 
arrests and tremor in other parts of her body? 


You schedule a neurologic examination for the patient. 
This examination confirms the history of the voice disor- 
der and adds other important information: that the 
patient’s legs are beginning to shake and she feels as if her 
body is getting quivery also. Once or twice a week she 
chokes while eating, and she thinks she has some quiver- 
ing of her breathing as well. She has previously been told 
that her symptoms are “all in her head.” The neurologic 
examination discloses a rapid, forward-backward head 
tremor, voice tremor, and respiratory tremor. Her hands 
and feet are quite steady. She has a small-fiber peripheral 
neuropathy in her legs and mildly decreased reflexes. Her 
blood pressure is elevated. 

The neurologist is the first person to ask the patient 
about any family history of voice or associated tremors; 
previous examiners had failed to do so. From this single 
question emerges the most important information of the 
entire series of examinations—that tremors in different 
parts of the body are rampant in the patient’s family. The 
department of genetics produces the pedigree in 
Fig. 10.16. They reported the following data: 





+ The patient’s maternal grandmother had tremor of her 
body and voice. 


Her maternal-maternal great grandfather had tremors 
of his voice, arms, legs, and head. 


+ Her maternal-maternal great aunt had no outward 
tremor but is said to have “shook inside.” 


+ The mother has tremor of her voice and hands, cannot 
use her hands, and has to be dressed and fed. 


@ One maternal aunt has tremor of her voice and arms 
and cannot use her hands. 


+ One maternal aunt has tremor of her body, voice, and 
hands but is able to use her hands. 


+ Two maternal first cousins are said to have voice and 
general tremor. 


+ Her sister has tremor of her head. 

+ Her brother has tremor of his hands. 

+ One son has a “halting” speech pattern. 

Q4: What is the etiologic diagnosis of this patient’s voice 
disorder? 

Q5: With what other voice disorders is this one often confused? 


Q6: How does the physiology of the voice disorder explain 
the patient’s voice tremor and voice arrests? 


Q7: What relevance is the history that the voice worsens 
under stress? 


Q8: What therapy would you suggest for the voice disorder? 


Q9: How would you counsel a person with this type of 
voice disorder as to its prognosis? 


Discussion 


This patient’s voice arrests are momentary laryn- 
gospasms produced by intermittent hyperadduction of 


Figure 10.15 Oscillographic tracing 
of audio recording showing voice 
arrests produced during vowel /a/ pro- 
longation. Each time the tracing 
returns to baseline represents no voice, 
or voice break. 1 second equals 25 mm 
revealing that arrests occur approxi- 
mately 4 times/second. 
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© Voice Tremor 
Vi. 1 © General Tremor 


the vocal folds. Their regularity or rhythmicity is more 
apparent during vowel prolongation than during contex- 
tual speech. Figure 10.15 shows that these arrests occur 
at a rate of approximately 4 Hz. Rhythmic voice arrests 
and tremor elsewhere in her body (but only on initiation 
of movement, not at rest) mean that she has an intention 
tremor involving her larynx, head, and other body parts. 
In this particular patient, tremor is present in other mem- 
bers of the patient’s family. She qualifies for the diagnosis 
of essential (heredofamilial) tremor. 

Her voice disorder, which some clinicians might have 
labeled adductor spasmodic dysphonia without qualifica- 
tions, needs to be further specified as the adductor spas- 
modic dysphonia of essential tremor, which is to be distin- 
guished from other forms of the disorder. The voice 
disorder is clearly neurologic and highly resistant to voice 
or drug therapy though many pharmaceutical agents 
have been tried with varying degrees of success. At pres- 
ent, the best that can be done for the patient is to educate 
her as to the etiology of the disorder—many patients are 
confused about this and are at least relieved to know that 
it is a recognized neurologic entity and that, although its 
prognosis is one of continued dysphonia, the disorder is 
not life-threatening. Education also helps the patient 
become a better consumer and able to be watchful for 
development of new and effective treatments. When the 
adductor spasmodic dysphonia of essential tremor 
becomes severe enough to threaten speech intelligibility, 
and not all do, consideration of recurrent laryngeal nerve 
resection should be given with the understanding that its 
effects may be temporary. However, experience and clini- 
cal studies establish that Botox injection can be highly 
therapeutic for this disorder, but some patients opt for 
the nerve resection rather than undergoing Botox injec- 
tions every 3 or 4months. Use of amplifiers and/or speak- 
ing softly may prolong the effects of botulinum toxin 
injections. 
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Figure 10.16 Family tree of patient in Case Study 
10.8 Strained hoarse voice arrest-itermittent rhyth- 
mic group (SHVA-IR) determined to be adductor 
spasmodic dysphonia of essential (heredofamilial) 
tremor. The arrow indicates the patient described in 
the text. 


Voice Tremor Group (VT) 


Case Study 10.9 


This is a 68-year-old, retired female telephone operator 
referred by a laryngologist who writes that the patient has 
a tremor of her voice, that it is “functional,” and that voice 
therapy is recommended. The patient is a heavy smoker 
and has bilateral polypoid changes on both vocal folds 
(Fig. 10.17). 

The patient enters your office, and you notice her right 
hand shakes as she extends her hand in greeting. For a 
moment, as you look at her sitting in silence, her head 
seems to be nodding rhythmically, but her hands rest qui- 
etly in her lap. Upon responding to your question as to 
when her voice trouble began, you hear a coarse, low- 
pitched, quavering voice. The patient says she has been to 
several doctors about her voice, which is embarrassing to 
her. She is apologetic about not heeding her local physi- 
cian’s advice to stop smoking. Recently she was told that 
her voice was a sign of mild Parkinson’s disease. Her voice 
tremor had begun 8 years ago, so gradually she was hardly 
aware of it. However, her head tremor began 30 years ago. 
Her brother, who is 3 years younger, has had head and 
hand tremor for 4 or 5 years, but his voice is normal. 

The voice tremor came on after the extraction of three 
teeth for pyorrhea. She was given thiopental sodium anes- 
thesia. All went uneventfully. Two weeks after surgery, her 
teeth were fitted and she had some trouble with her articu- 
lation. Six weeks later, a quaver in her voice first appeared. 
Since then, both the voice and head tremor have gradually 
worsened. Her tremor is worse when she is tired or upset. 


Q1: What is meant by voice tremor? 


Q2: What would you expect to see on laryngoscopic exam- 
ination during vowel prolongation in a patient with 
voice tremor? 
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Organic (Essential) Tremor Figure 10.17 Flow schema of diagnostic decision making for 
(Mild-Moderate) cases of Vocal tremor group (VT) as illustrated by Case Study 
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Q3: From the data given, what arguments can you muster 
in support of the diagnosis of parkinsonism as a cause 
of the voice tremor? 


Q4: From the data given, what arguments can be made 
against parkinsonism as a cause of her dysphonia? 


Q5: What evidence do we have that the voice tremor is 
“functional” or psychogenic? 

Q6: Why does the patient have a low-pitched hoarseness 
in addition to the tremor? 


An oral-physical and motor speech examination 
revealed the following information: 


+ The patient’s voice tremor on vowel prolongation aver- 
aged 5 Hz. The tremor smoothly varied, but occasion- 
ally she would produce a complete voice arrest. 


@ Synchronous with the voice tremor were pharyngeal 
and tongue tremor. Almost unnoticeable were also lip 
and mandibular tremors. 


+ Although no tremor was noticed when the upper 
extremities were at rest, the patient had fine hand 
tremor when she extended her arms with fingers 
spread. 


+ The patient casually mentioned during this examina- 
tion that she has used an alcoholic beverage to reduce 
the severity of her tremors. 


Q7: What formal etiologic diagnosis would you postulate 
for this voice disorder? 

Q8: How would you proceed from this point on? 
Suspecting a neurologic etiology, you arrange for the 

patient to have a neurologic examination, which showed 

the following: 

+ terminal tremor of both upper extremities 

+ tremor of her head and mandible 


+ mild reduction in sensitivity to touch over the dorsum 
of both feet 


+ mild incoordination on finger-to-nose testing, bilater- 
ally 
+ mild incoordination on heel-to-knee testing 


The neurologist’s conclusion from the history and phys- 
ical examination: “This seems to be a nonspecific tremor— 
probably of basal nuclear origin and possibly familial. It 
falls within the scope of organic (essential) tremor or 


Cerebellar Disease 


10.9. 


heredofamilial tremor. | doubt that anything is likely to be 
helpful.” 


Discussion 


Outside of neurologic circles, tremor of the voice is often 
misdiagnosed as either “functional” or psychogenic or as 
associated with parkinsonism. On laryngoscopic examina- 
tion, during vowel prolongation the vocal folds are easily 
visualized rhythmically adducting and abducting, the pitch 
rising during adduction and falling during abduction. 
When severe, the vocal folds will tightly adduct stopping 
vocal fold self-oscillation, producing voice arrests. If these 
are pervasive, the voice disorder takes on the characteris- 
tics of adductor spasmodic dysphonia. 

Parkinsonism does not usually produce voice tremor. 
In some parkinsonian patients, one can hear a highly sub- 
tle tremor like “flutter” on vowel prolongation, but it 
does not possess the coarser type of tremor as in this 
patient who has organic (essential) tremor, probably 
familial. Patients with parkinsonism have tremor, but it is 
a rest tremor. Essential tremor is an action or intention 
tremor. 

The patient under discussion was quiescent at rest and 
exhibited her tremor only during the volitional acts of 
phonation and extension of her extremities. The student 
might ask why, then, did she have head tremor at “rest”? 
The answer is that holding up the head is not actually a 
resting activity; doing so requires active muscle contrac- 
tion to overcome the effects of gravity. 

Support for psychogenic etiology was unsubstantiated, 
although stressful events can aggravate and even precipi- 
tate tremor, as they do in most neurologic diseases. 

The low-pitched hoarseness was consistent with the 
polypoid changes of the vocal folds secondary to the long 
history of tobacco abuse and had no relation to the voice 
tremor. 

Alcohol consumption frequently (not always) reduces 
the amplitude of organic voice tremor and is useful infor- 
mation in diagnostic history-taking. 

Management of organic voice tremor is not easy. 
Symptomatic voice therapy may reduce some of the coun- 
terproductive compensatory muscle patterns but is inef- 
fective in eliminating the tremor. No convincing evidence 
of the effectiveness of essential tremor medications for 
voice tremor has been reported, even though such med- 
ications are often beneficial for tremors elsewhere in the 
body. Although many patients spontaneously employ 
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alcohol as a palliative for voice tremor, it is hazardous 
because of its potential for producing dependency. 
Educating the patient about the etiology of the voice 
tremor is more valuable than clinicians may realize. As 
with many other disorders about which patients have 
been given an incorrect diagnosis, or none at all, to learn 
that essential tremor is organic rather than psychogenic, 
that it is benign, that it does not necessarily have to 
worsen, and may even improve, and that the dysphonia is 
common and is acceptable to many listeners can be a 
genuine relief to patients seeking help for this disorder. 


Breathy Whispered Intermittent Group (BW-I) 


Case Study 10.10 


A 39-year-old clergyman is referred to you by a speech- 
language pathologist colleague. The patient has had a 
complete laryngologic examination that failed to disclose 
any organic laryngeal disease. The referring speech clini- 
cian was not experienced in voice disorders and had 
administered therapy without success. He requests 
another opinion on diagnosis and treatment of the dys- 
phonia because he is unsure of the diagnosis of abductor 
or adductor spasmodic dysphonia (Fig. 10.18). 

The patient, a tall, blond man of striking appearance, 
has penetrating eye contact and exudes an air of solidity 
and authority. He is pleasant and straightforward. His lan- 
guage is explicit. He gives the impression of someone 
who genuinely wants help and will cooperate in any way 
he can to get it. After completing the preliminaries, he 
interrupts the examination by asking if his wife could join 
in the examination, explaining that she has been worried 
about his voice. 

During conversational speech and oral reading of a 
standard paragraph, you note the following: 


@ Every few seconds, his normal voice cuts out and 
becomes breathy or whispered. Then, just as suddenly 
it returns to normal. 


—Neurologic — Organic (Essential) Tremor —— 
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These moments of breathiness most often follow 
unvoiced consonants. 


They are more often triggered by plosives than by frica- 
tives. 


+ These plosives are almost invariably unvoiced. 


è The unvoiced plosives trigger breathiness nearly 
always when they are in the initial rather than the 
medial or final positions of words. 


The patient can produce several sentences without any 
segments of breathiness. 


During vowel prolongation, his voice is continuously 
normal (i.e., free of breathy segments). 


Each time he produces a moment of breathiness, you 
notice an abrupt inward movement of his abdomen. 


A complete oral-physical and motor-speech examina- 
tion fails to disclose any evidence of structural or motor 
disturbances in the remainder of his speech mechanism. 
The ear, nose, and throat examination is normal. His 
cough is normally sharp. 

You take a careful history. “When and how did your 
voice disorder begin?” He tells you that it came on 5 years 
ago very gradually, increasing to its present level of sever- 
ity within 6 months. It has remained this way ever since. 
He notices that sometimes his voice is better when he is 
relaxed and worse when he is under stress, such as while 
delivering a sermon. He tells you that his voice has never 
returned to normal for more than a few minutes during 
the entire history of his voice disorder. 

One year after his dysphonia had begun, he sought the 
help of a speech pathologist who, although not a special- 
ist in voice disorders, agreed to a trial of therapy, because 
there was no one else available in the patient’s locality. For 
3 months, twice a week, the patient was given therapies 
consisting of grunting, chewing, and relaxation exercises 
associated with biofeedback sessions during which elec- 
trodes were placed over his forehead and jaw muscles. 
These ministrations were mildly and temporarily helpful, 
but they benefited his voice least when he needed it most, 
namely, during the counseling and public-speaking 


Adductor 
Spastic 
Dysphonia 
(Neurologic) 


Abductor 
Spastic 
Dysphonia 
(Psychogenic) 


Figure 10.18 Decision making schema 
for Breathy whispered intermittent Group 
(BW-1) illustrated by Case Study 10.10 
demonstrating abductor spasmodic dys- 
phonia. 
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demands of his profession. After what he felt was a fair 
trial of therapy without acceptable results, he broke off 
further appointments and did nothing about his voice for 
3 years. 

“Did the speech clinician ever take a history of any psy- 
chologic or emotional problems that you might have had 
at the time your voice disorder started?” The patient 
answers that he had been asked by the speech-language 
pathologist some general questions but did not go into 
any depth; he seemed to emphasize voice retraining over 
other considerations. 

You begin to probe the subject of emotional stress. The 
patient readily accedes to this line of questioning. You ask 
him about his relationship with his wife and his profes- 
sional associates and are comfortable with his responses 
that indicate (1) he has a stable, amicable relationship 
with his wife; (2) he is effective and well-respected as a 
pastor of a small, upper-middle-class suburban commu- 
nity. 

“Did anything upsetting happen at the time the voice 
disorder started 5 years ago?” The following information 
emerges. 

The voice trouble began during a period when their old- 
est daughter had been arrested for shoplifting. Father and 
daughter began to have angry “blowups” during which 
the father was frequently driven to the verge of striking 
her. The parents tried desperately to help but they had 
never sought professional assistance. 

Commenting on his voice during the height of his con- 
flicts with his daughter, he says: “The more | would 
yell, the worse my voice would become.” During the 
patient’s narrative of this difficult period, his moments of 
breathy voice are more frequent and of longer duration. 
Toward the end of his narrative, as he brings the history 
up to the present, he has fewer and briefer moments of 
breathiness than at the beginning of the interview. 

During trial-and-error therapy, you discover that if the 
patient makes an effort to sustain voicing during 
unvoiced consonants, he has fewer and shorter breathy 
and aphonic breaks. 


Q1: Describe the movements of the vocal folds responsible 
for the voice aberrations during the patient’s contex- 
tual speech. 


Q2: Whatis the most likely (physiologic phonetic) explana- 
tion for the fact that the breathy and aphonic periods 
are more severe after unvoiced plosive consonants? 


Q3: Why are the breathy moments less severe or absent in 
association with unvoiced consonants in medial and 
final rather than in initial positions of words? 


Q4: How do you account for the patient’s ability to sustain 
vowel sounds without breathy or aphonic breaks? 


Q5: What is the most plausible explanation for the abrupt 
inward abdominal movements simultaneous with 
moments of breathy voice? 


Q6: What symptomatic diagnosis would best describe this 
voice disorder? 


Q7: What do you think is the etiology of this patient’s voice 
disorder? Support your interpretation. 





Q8: How would you proceed in the treatment of this 
patient’s voice disorder? 


Discussion 


Patients who have alternately normal voice and breathy or 
aphonic segments during contextual speech may be 
described as having abductor spasmodic dysphonia. This 
is a descriptive term only; it tells us nothing about its 
cause, which requires further investigation. A fiberscopic 
examination of the vocal folds during contextual speech 
discloses that each time we hear an aphonic break, we see 
the vocal folds abduct, and each time we hear a voiced 
segment, we see them adduct. The sudden inward move- 
ment of the patient’s abdomen with each aphonic seg- 
ment occurs because of the sudden drop in vocal fold 
resistance to the air stream being forced from the lungs 
by abdominal contractions. 

In abductor spasmodic dysphonia, the loss of muscle 
tension seems to be worsened by unvoiced, as opposed to 
voiced, consonants, especially voiceless plosives. Appar- 
ently, there is some pressure-timing factor that with these 
phonemes inhibit timely contraction and closure after 
vowels, and as a result they overshoot, releasing more 
than the usual amount of air. Their propensity to do so is 
greatest when the plosives are in the initial positions of 
words rather than in medial positions, where they are 
bounded by voiced phonemes, or the final position of 
words where they are preceded by voiced phonemes. Pre- 
sumably, that is why vowel prolongations usually contain 
few or no aphonic breaks; the larynx works less as an artic- 
ulator and does not require as much in fine motor control 
adjustments required for connected speech, and vocal 
folds are adducted and held in a steady state during the 
entire period of phonation and are not required to shift 
from voicing to unvoicing and back again. 

The psychosocial history strongly points to the proba- 
bility that this patient’s dysphonia is psychogenic, having 
begun during a period of intense anger and public humili- 
ation brought on by the daughter’s antisocial behavior. 
What is curious is that the dysphonia persists despite res- 
olution of the daughter’s difficulties, although the parents 
still harbor some concern about her. As of this writing, 
symptomatic voice therapy has not been successful. 
Although this suggests deeper psychiatric investigation is 
probably warranted, in cases like this voice therapy should 
always be tried, and if it fails Botox would be another 
option. Teaching the patient to voice all consonants may 
help the patient override the breathy or voiceless seg- 
ments. The etiology of this patient’s voice disorder has 
not been firmly established and, for the present, must be 
considered idiopathic. 

As a final note, we should be aware that, like adductor 
spasmodic dysphonia, the abductor type can be of neuro- 
logic as well as psychogenic etiology. Two signs of the 
neurologic type are an absence of a positive psychiatric 
history and, on vowel prolongation, instead of steady 
voice, there are rhythmic unvoiced interruptions that may 
indicate abductor spasmodic dysphonia of essential voice 
tremor. 
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Hypothyroidism Clinician (C): In terms of loudness? 
P: In terms, | think, of volume. 
Endocrine Virilization C: Volume. That’s what | meant. How about pitch? Do 
you think you strive for a low-pitched voice, consciously or 
Acromegaly unconsciously? 
P: | don’t know. | may do that. | don’t know. | may push 
toward a lower voice range or frequency. | don’t know why 
Contact Ulcer i; 5 y 3 
7 i | might do that. Generally, I’m not conscious of it. |I may be 
_— doing it and not be conscious of it. 
Reduced Hyperplasia 


C: Can you think of any ideas as to why you might be 


Mc saith — pushing for a lower frequency or a lower pitch? 
Frequency P: Oh, | don’t know. | suspect it might be more com- 


fortable in a higher range. 
C: Any other thoughts come to mind? 
P: | don’t know whether | feel this is more impressive, if 
I’m doing this because | think it’s more impressive. 
Chronic Stress C: There’s a possibility that that may be true? 
P: | can’t isolate that. 
L C: It’s a possibility, but not necessarily true. 
P: | know that when I can start speaking in Norwegian | 
can allow a kind of higher range that’s easier. 
C: It’s easier to speak at a higher pitch? 
Figure 10.19 Flow chart demonstrating schema of decision making P: Well, it is when I go into a different language. 
in cases of Low-pitched hoarse group (L-PH) as described for Case Study C: How about in English; think you should go into a 
10.11. higher pitch? 





Psychogenic 


Elective Low Pitch 


Low-Pitched Hoarse Group (L-PH) 


Case Study 10.11 


The following case is that of a 41-year-old professor of 
English literature with a 5-year history of episodic hoarse- 
ness related to lecturing.? The hoarseness that brought 
him to the clinic began 7 months prior to evaluation and 
coincided with the beginning of a regular teaching sched- 
ule and new duties as a curriculum supervisor. Indirect 
laryngoscopy showed a contact ulcer granuloma on the 
posterior third of the left vocal cord (Fig. 10.19). 

Patient (P): Why, | suspect I’ve had this kind of voice irri- 
tation for several years. The last time | had an examination 
of the larynx was, I think, about 5 years ago, and there was 
some irritation there but no growth. It has just been this 
fall that |, as | said, had the laryngitis, at least that’s what | 
call it. | was concerned about it and sinus headaches and 
so forth. But it was the hoarseness and the irritation and 
the voice loss that concerned me most, and | work hard at 
teaching. | move around a lot, | push myself. My style is a 
kind of Socratic, excited, let’s-move-ideas-around kind of 
style, and | work hard at it, and | force, | know | push my 
voice. I’m pushing it. | don’t sit quietly and let my voice 
assert itself. When | lecture, particularly in a larger group, 
and I’ve had classes of 35 and 40 and 50, | tend to really 
move out, and | do know that | push my voice beyond its, 
certainly its normal speech level. 


P: | don’t know, but it seems to be a little easier and | 
seem to be pushing it. | seem to be speaking farther for- 
ward or getting the vibration farther forward in my head 
or something. It does hurt more, like when | drop down 
now, as opposed to speaking maybe up like this. 

C: Do you lecture in a large room? 

P: Yes, | have. 

C: Are the acoustics good or bad? 

P: They vary tremendously; generally speaking, they’re 
not very good. 

C: Why is that? What is the nature of the acoustics? 

P: Just flat. There is a sense that you have to really get up 
and project. Most of my colleagues have mentioned this, 
too. | don’t think students could really hear you in the back, 
you know, if | sat and talked as I’m talking now, for exam- 
ple. Even if it were very quiet, some of the students in the 
back row simply couldn’t hear me in those rooms. | have to 
push more than I’m pushing right now with my voice. | 
move toward them. I’ve done everything | can spatially to 
get close to them, at least in physical proximity. | do think 
that right now | can feel a little irritation and hoarseness. 

C: How about noise levels, background noise levels, as 
you're trying to lecture? 

P: It’s very quiet. I’ve never had a problem with that. | 
suspect it might be in secondary education, but certainly 
there’s none. | think | am nervous. I’m always overpre- 
pared. I’m always very anxious about doing well. | always 
have, so that | feel very satisfied, and I’ve always had good 
responses, but I’ve had to work hard at it, and | sweat a lot. 
| know I’m working hard. I’m in there, | get ready for the 
class, | prepare very carefully, | plan courses very carefully. 
| work with them very carefully, so | suspect that | am nerv- 


?An actual audio recording of the transcription of the following inter- 
view can be heard in Aronson, A.E. (1973). Audio seminars in speech 
pathology, psychogenic voice disorders. Philadelphia, PA: W.B. Saunders. 
Cassette no. 1, side 1. 


ous about class. 
C: Now, how will your voice fare during the course of, 
say a 50-minute lecture, when you are in this particular 


228 Clinical Voice Disorders 


mood and you have this degree of tension or anxiety? Will 
it last or not? 

P: It will last, but | feel the strain will last 10 minutes or 
so. In fact, there will be a more pronounced hoarseness. | 
can feel the tightening. On the other hand, well | suspect 
that no matter whether I’m using the Socratic method or 
not, or just moving into a kind of casual, easy going kind of 
dialogue of some kind, that the differences are a differ- 
ence of degree, and in those instances | feel the strain and 
tightening. 

C: Will that tightening also be accompanied by, say, a 
dull aching in your larynx? 

P: You’re asking me to become conscious of things | 
haven’t really paid much attention to, and I’m always a lit- 
tle suspicious of my own recall and my own awareness of 
my physiologic processes, but | think that’s true. And | 
think there is even a tendency to bring about some sort of 
headache, a little bit of a headache from it. That is, | can 
feel irritation here starting to bring about a headache 
reaction . . . But it’s never been easy for me to talk. | was 
always very quiet when | was younger, and through my 
whole career as a teacher I’ve worked hard to project, to 
get away from this. I was fairly introverted and quiet, and 
I’ve always had the feeling that other people have had 
stronger voices, that | have not had a particularly strong 
voice. But, as | said, | can always remember this kind of, 
this tendency toward this, this hoarseness, and when | 
talked for a long time or lectured, | taught high school for 
4 years, and | had 5 hours of class a day, | would be darn 
sore at the end of the day. | would tend to become very 
quiet and my wife would say that | would, for the rest of 
the day, speak very, very quietly if | could. 

C: Would you feel sort of worn out in general from talk- 
ing, sort of fatigued? 

P: From that more than anything else. Just the physical 
business of talking. Not the classroom dynamics or any- 
thing like that. It was just talking. It has never been partic- 
ularly easy for me. And I’m in a profession where that’s the 
medium, you see, that’s the demand. 


Q1: What is the physical explanation behind the exces- 
sively low pitch of this patient’s voice? 


Q2: What is the reason for the contact ulcer being located 
at the posterior third of the vocal fold? 


Q3: What is the contribution of the patient’s artificially 
lowering his pitch to his voice disorder, and what 
would you predict the stroboscopic image would look 
like? 


Q4: What might be the causes of the patient’s complaint 
of tightness and dull aching in his throat? 


Q5: Comment on the relative contributions of vocal abuse, 
personality, and emotional stress to the development 
of this patient’s voice disorder. 


Discussion 


This patient has a poor acoustic working environment 
coupled with poor vocal technique of long-standing dura- 





tion. It is likely that he uses excessive muscle tension to 
artificially lower and to project his voice resulting in some 
discomfort. The tightness and dull aching in the throat 
may emanate from a combination of muscle tension, con- 
tact ulcers, and tissue irritation from reflux. Some clinicians 
would choose to directly observe the patient teaching or 
ask him to bring in a videotape of his teaching in a large 
classroom to determine how his teaching voice differs 
from his conversational speech. As in most other disor- 
ders, it is also likely that emotional stress increases his 
vocal difficulties and lowering his pitch may increase his 
adduction quotient, the amount of time in which glottal 
cycle vocal folds are closed relative to the time opened, 
increasing the probability of tissue damage. You would 
predict his videostroboscopic examination would show 
anterior-posterior and medial-lateral constrictions of the 
supraglottis and a long closed phase similar to that seen 
during glottal fry. Both amplitude and mucosal wave 
would be severely reduced but the tissue of the membra- 
nous portion would appear during inspiratory phonation 
to be pliable. Contact ulcers generally occur on the poste- 
rior third of the glottal margin at the vocal process where 
the vocal fold tissue is its thinnest.? Approximately 80% of 
males close this portion of the glottis whereas most 
females have a posterior glottic chink. This suggests one 
of the reasons contact ulcers are seen in males but rarely 
occur in females who have not experienced intubation 
trauma. Contact ulcers are difficult to treat but generally 
benefit from voice therapy combined with a strong antire- 
flux regimen. Surgical removal without concomitant 
behavioral and medical management often results in 
recurrence. Behavioral modification is generally directed 
at improving vocal fold vibratory patterns by finding 
appropriate pitch, reducing posterior glottic closure and 
hard glottal attacks, improving vocal projection, and, 
when necessary, using amplification. 


High-Pitched Group (HP) 


Case Study 10.12 


A 23-year-old college student comes to the clinic com- 
plaining of general fatigue. She has a history of mononu- 
cleosis. Six months ago, she noticed a drooping of her left 
eyelid that was diagnosed elsewhere as a sign of myasthe- 
nia gravis. However, a Tensilon test was negative. During 
the neurologic history, she does not complain of difficulty 
chewing or swallowing, double vision, or limb weakness; 
in fact, she says she runs 3 miles a day. Her neurologic 
examination is normal; however, her history of left ptosis 
leads the neurologist to conclude that she does have ocu- 
lar myasthenia that is currently asymptomatic. After dis- 
cussing the pros and cons of continuing her medication, 
she is referred for evaluation of an accompanying speech 
disorder. 


3Contact ulcers havc been attributed to esophageal reflux, intubation, 
and vocal trauma. 
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[~ Endocrine — Anatomically Small Larynx because “My voice is me. | don’t have a problem; my prob- 
lem is that other people have a problem with my voice.” 
Q5: What do you think is the diagnosis of this person’s 
speech disorder? 

Q6: How would you proceed in handling this case? 

Increased . g 

HP Vocal Fold Mass — Web Discussion 
Fundemental 











This is an instance of the relatively rare psychogenic disor- 
der known as childlike speech in adults. Technically, it is 
not solely a voice disorder, as articulation is integral to the 
total syndrome. It bears many of the features of conver- 
sion disorder in that the patient’s abnormal use of the 
speech musculature is unconsciously determined, that 
the childlike speech “selected” serves to communicate 
something very important about the self, and that the 
disorder probably affords the patient primary or secondary 
gain in her daily life. The prior diagnosis of an abnormally 
small larynx was presumptive; the speech examination 
proved she was capable of producing a lower pitch appro- 
priate to her age and sex. Nothing in her speech remotely 
resembled the flaccid dysarthria of myasthenia gravis 
with its breathy, weak voice, hypernasality, and consonant 


Frequency 


Conversion High Pitch 





Mutational Falsetto 


—Psychogenic (Puberphonia) 


Childlike Speech in Adults 


Figure 10.20 The High-pitched group [HP] schema demonstrates 
the possible causes of increased vocal fold fundamental frequency 
exhibited by the patient described in Case Study 10.12. 


The patient enters your office casually dressed and 
looking younger than her 23 years. From the moment she 
speaks, you are startled by her high-pitched, though not 
falsetto, voice and the way in which she articulates with a 
slight pursing of her lips and a subtle interdental lisp. Her 
prosody is that of exaggerated pitch changes. Her speech 
gives you the uncomfortable feeling that you are talking 
with a child rather than an adult college student. 

She says her speech began to change about 4 years ago. 
Two years ago, an otolaryngologist said she had a high 
voice because her larynx was too small. Laryngologic 
examination here mentions nothing about an abnormally 
small larynx (Fig. 10.20). 


Q1: How do these speech signs relate to myasthenia 
gravis? 


Q2: What further tests would you recommend to determine 
if her speech had anything to do with that disease? 


Q3: What are the cardinal speech signs of myasthenia 
gravis? 


Testing her speech musculature fails to disclose any evi- 
dence of lip, tongue, mandibular, laryngeal, or respiratory 
weakness. The all-important stress test of counting fails 
to produce deterioration of her speech. Her cough is nor- 


imprecision that worsen during effortful speaking. Her 
myasthenia gravis, limited to her ptosis, was completely 
independent of her speech disorder. 


This patient was adamant in her rejection of our offer of 
speech therapy; in fact, her insight was so poor that she 
did not even think her speech was abnormal; recall her 
conclusion was that the problem was other peoples’, not 
hers. Her denial that her speech was abnormal, also com- 
mon in conversion reaction, and her rejection of help for 
the “symptom” require respect, for this patient is telling 
us that she is not ready to relinquish the protection 
afforded by her childlike speech. Her frankness in disclos- 
ing seriously damaging childhood experiences and her 
acknowledgment that she needs, and is getting, counsel- 
ing probably indicate that she may one day be well 
enough to be able to give up her inappropriate speech 
and return to a normal speech pattern that she once had. 
We wished her well and sent her on her way. 


Summary 


mally sharp. You can find no signs of motor-speech disor- + In actual clinical practice, differential diagnosis of a voice 
der. She can lower her pitch level on demand, but she disorder begins with categorizing the abnormal voice and 
does not maintain it during conversation speech. then working backward to its etiology by means of logical 
correlation with laryngologic, psychologic, physical, and 


Q4: What would be the next step in your investigation? page 
neurologic evidence. 


The psychosocial history discloses that between the ages 
of 10 and 17, the patient and both of her parents were alco- 
holic. She gives a history of physical abuse and psychother- 
apy on and off since age 13. She says she is just beginning to + Although the voices within these “clusters” may sound sim- 
get hold of her problems with the help of a school coun- ilar, they can arise from mass lesions, neurologic diseases, 
selor. Asked if she would like to have help with her speech, medical conditions, vocal misuse, or psychologic disorders, 
she replies that, for the time being, she would prefer not to which can only be determined by further clinical studies. 


+ Abnormal voices tend to cluster into perceptual groups, 
but diagnosis cannot be made by voice alone. 
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è Interaction between organic and psychogenic factors ¢ Regardless of the primary etiology, patients may have 
must be borne in mind in the assessment of all voice dis- comorbid factors that contribute, if not to the etiology, to 
orders. the maintenance of the voice problem. 


Patients may have two, or more, coexisting problems that 
are independent. 


Chapter 11 


Treatment of Voice Disorders 


Voice therapy may be defined as an effort to return the voice 
to a level of adequacy that can be realistically achieved and 
that will satisfy the patient’s occupational, emotional, and 
social needs. Not everyone, especially not those who have 
irreversible neurologic or vocal fold lesions, can achieve a 
normal voice. In such cases, the objective is the best possible 
voice within the patient’s anatomic and physiologic capabil- 
ities. It is to the credit of most patients that they understand 
and accept their voice limitations. 

Some voice disorders are so mild that one has difficulty 
deciding if an abnormal voice actually exists—though it 
makes it no less significant to the patient as in cases of pro- 
fessional voice users where a mild voice disturbance can 
cause maximum distress and loss of income. In some 
instances, the voice disturbance may only exist in the singing 
voice, or after a strenuous day of talking, or during periods of 
emotional stress, or may be related less to the voice than to 
increased vocal effort. In other cases, the voice problem may 
be severe but the patient may be unaware of or unconcerned 
about the voice but very concerned about vocal fatigue. 
Who, finally, is to decide if voice therapy is necessary? Ulti- 
mately, it is a team decision, and the patient is a critical 
member of the team. If a reasonable effort has been made to 
increase awareness of voice potential, but therapy is 
declined, such is the patient’s right. Where children are con- 
cerned, the clinician will usually try more aggressively to 
obtain the child’s cooperation, with the consent of the par- 
ents. With experience, clinicians learn that not everyone 
needs to be helped, can be helped, or wishes to be helped. 
However, for those who can be helped and are motivated to 
change, the therapeutic process can result in an improved 
voice and quality of life. 


+ General Principles 


General principles and methods of behavioral voice therapy 
are continuously evolving. The literature advocates different 


techniques for the same disorder, and useful therapies have 
yet to be developed for many disorders. Capable clinicians 
sometimes are at a loss to say exactly what they do to bring 
about voice improvement. Reading the literature on voice 
therapy produces uncertainty as to who is right and which 
alternatives work because there are limited evidence-based 
references available. Underneath all of the apparent alterna- 
tives, however, lie some common principles. 


Remove Obstacles 


Progression from abnormal to normal voice takes place as a 
product of the patient’s conscious, volitional response to the 
clinician’s instruction and encouragement. Yet, at the same 
time the larynx has a natural propensity toward normal 
function in nonorganic disorders. There is a sense one has 
about therapy for nonorganic voice disorders that, rather 
than the need to teach the patient how to use voice again, 
the clinician’s task is to remove obstacles to normal laryn- 
geal function and, by so doing, allow the natural inclination 
for normal voice to break through. Humans do not have to be 
taught how to phonate in the first place. There is no reason to 
believe that our capacity for normal voice does not exist 
throughout life, though clinicians need be mindful that “nor- 
mal voice” differs across the age span. 


Provide Feedback 


Feedback occupies a pivotal position in voice therapy, cut- 
ting across all voice disorders. Individual learners respond in 
different ways to tactile, proprioceptive, visual, and auditory 
feedback. 


Look for Emotional Distress 


Voice disorders resulting from emotional stress ought not be 
treated as only mechanical problems. Consideration of the 
patient's self-concept, daily life problems, and dysphoric 
affect ideally is incorporated into therapy. On the other hand, 
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although treating the cause of a voice disorder may be as 
important as voice production training, the cause may either 
never be found, may be unalterable, or may be beyond the 
scope of practice of the treating speech-language patho- 
logist. 


Offer Individual Therapy 


Voice therapy has to be tailored to the individual. Although 
the fundamental principle of therapy for a given disorder 
does not vary, differences in age, gender, hearing, acquisition 
to feedback, vocal awareness, motivation, intelligence, edu- 
cation, culture, occupation, vocal needs, and general health 
require that techniques be modified to suit these variables. 
Thus, voice therapy is individually tailored to each individ- 
ual. 


¢ Specific Principles 


For many years, the notion has been fostered that voice dis- 
orders follow the same laws of learning as do, for example, 
articulatory or language disorders, and, as such, are treated 
in the same way by applying those laws to the larynx and 
voice. Whereas traditional learning does have application in 
voice therapy (e.g., practice, targeting, trial-and-error, and 
the like), special facts about the larynx and voice that do not 
apply to articulation and language need to be recognized. 


Phonation 


Phonation is built into the organism as an innate neurophys- 
iologic function. It is not “learned” in the same sense as is 
articulation or language. The source of sound is the larynx, 
which also serves sometimes competing functions of airway 
control, swallowing, laughter, and crying. 


Organic Disorders 


In organic disorders caused by vibratory disturbances 
resulting from vocal fold mass lesions or neurologic disor- 
ders of paralysis or movement, the main principles of ther- 
apy are either muscle strengthening, or reducing force of 
closure, or improving resonance balance and vocal tract 
coordination, or adaptation to the mechanical problems by 
means of positive compensatory phonatory and respiratory 
maneuvers. 


Behavioral Voice Disorders 


Behavioral voice disorders, disorders of misuse, abuse, or 
compensation, exist for one or more of the following reasons 
and often exist in combination with organic problems. They 
can occur at the level of the vocal folds or occur at the supra- 
glottic level resulting in resonance imbalance. They may or 
may not have the presence of a lesion. They may or may not 
be related to vocal use, vocal environment, or occupation. 


Hyperfunctional Voice Disorders 


The vocal folds are adducted with excessive force for a variety 
of reasons including general laryngeal muscle tension as a 
reaction to the stress of life (e.g., acute or chronic depression 
or anxiety or excess vocal loading related to occupational or 
recreational needs) or because of personality traits or to com- 
pensate for underlying pathology. These are called hyper- 
functional or musculoskeletal tension voice disorders. They 
are treated primarily according to the principle that when the 
muscular tension is reduced, the larynx’s capacity for normal 
phonation will normally return if the tension is not compen- 
satory. Koufman and Belafsky (2002) state that, “As many as 
90% of patients with abnormal laryngeal biomechanics have 
an underlying organic disorder and are not simply vocal mis- 
use, poor technique or abuse.” Based on a prospective analy- 
sis of 2000 consecutive patients seen at the Center for Voice 
Disorders of Wake Forest University School of Medicine, they 
developed a multifactorial model of voice disorders (the 
B.I.N.N. [behavioral, inflammatory, neuromuscular, neoplas- 
tic] model) and, based on their data, suggest that the average 
patient with a voice disorder has two or more diagnoses. Typ- 
ically these are behavioral (90%), inflammatory (70%), neuro- 
muscular (30%), and neoplastic (20%). This is interpreted to 
mean that most voice disorders have a behavioral compo- 
nent, but it is often not the entire source of the problem. Thus, 
modifying vocal tension may optimize laryngeal biomechan- 
ics or reveal the underlying problem. Voice therapy, then, 
aims at releasing the inherently normal voice suppressed by 
excess muscular tension, an objective that can be accom- 
plished in three complementary ways: 


+ by mechanically (manually) relaxing the musculature 


+ by psychologically releasing the anxiety causing the ten- 
sion 


by creating improved aerodynamic muscular balance 
within the vocal tract 


The vocal folds may fail to adduct (hypoadduct) because 
they are muscularly flaccid or flabby due to nonuse or incom- 
plete use or change in feedback. These are called hypofunc- 
tional voice disorders and when behavioral in origin are 
usually secondary to prolonged use of a weak breathy voice, 
prolonged illness, prolonged voice rest, faulty learning, or age. 
Behaviorally, these disorders are treated primarily according 
to the principle of increasing the tonicity of the vocal folds by 
means of their full employment during phonation. 


Conversion Voice Disorders 


The vocal folds fail to adduct because, unconsciously, the 
patient has a need to be partially or completely without 
voice. These are the conversion voice disorders. The original 
loss or diminution of the voice produces three secondary 
effects: 


Patient Can’t Produce Normal Voice Soon after the apho- 
nia or dysphonia sets in, the patient loses the feel for how 
to produce normal voice, a dissociative-type reaction that 
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persists even after the interpersonal or situational conflict 
causing the voice disorder has been resolved. 


Elevated Musculoskeletal Tension Conversion voice dis- 
orders are not as passive as their whispering and breathiness 
suggest but are actually produced with elevated muscu- 
loskeletal tension. In other words, they contain within them 
a hyperfunctional component involving a variety of combi- 
nations of muscles of the larynx. 

The conversion voice disorders are primarily treated with 
behavioral therapies directed at exposing and, if possible, 
resolving the interpersonal or situational conflicts causing 
them, educating and physically guiding the patient into 
increasing degrees of phonation, and balancing muscu- 
loskeletal tension. 


Voice Abuse Disorders 


Cultural, personality, and emotional factors can lead to 
phonation with excessively sharp glottal attack or long 
closed phase, which traumatizes the vocal fold mucosa, pro- 
ducing general inflammation or discrete lesions. These are 
the vocal abuse disorders. Excess musculoskeletal tension is 
often a component of vocal abuse disorders. They are treated 
according to the principle of reducing habits of hard glottal 
attack, excessive loudness, environmental stress, and per- 
sonality problems. 


Precipitating Factors 


Internal and external environmental factors can lead to tissue 
irritation that combined with excessive vocal loading leads to 
vocal fold trauma. These are called precipitating factors and 
include laryngeal esophageal reflux disease, poor hydration, 
exposure to pollutants, and upper respiratory infections. 
They are treated according to the principle that changing 
lifestyle, modifying environment, or receiving medical treat- 
ment will reduce or eliminate the offending factors. 

Occupational and social factors can lead to excess talk 
time, loudness, or pitch that can lead to vocal abuse. These 
are called aggravating factors and are treated according to 
the principle that increased awareness will reduce or elimi- 
nate the aggravators and improve voice. 


Lesion 


The vocal folds may fail to completely adduct because of the 
presence of a lesion. Some of these lesions resolve with voice 
therapy aimed at reducing precipitating and aggravating fac- 
tors along with direct symptom management. Others require 
surgical management or a combination of surgical and voice 
therapy. 


+ General Voice Therapy 


Voice therapy is performed on two planes: (1) a specific one 
in which techniques are tailored to the requirements of a 


specific voice disorder type, and (2) a general one, composed 
of generally applicable methods. Both planes are interwoven 
in voice therapy, their proportions determined by the 
patient’s requirements and the clinician’s knowledge of 
treatment. 


Evidence-Based Medicine 


No introduction to voice therapy approaches would be com- 
plete without a concomitant discussion of evidence-based 
medicine. As in other areas of medicine, many treatments in 
common use have never been evaluated (Frattali, 1998). 
Increasingly, insurance companies, referral sources, and 
patients want “evidence-based” treatment. Although most 
realize there is never a treatment guarantee, they want to 
know what outcome can be expected for a given treatment 
and how long it will take to achieve it. Frattali lists five pri- 
mary outcomes and example factors that relate to voice 
therapy: (1) clinically derived outcomes (e.g., ability to sus- 
tain phonation or produce a lower pitch); (2) functional out- 
comes (e.g., ability to use the telephone or be heard in a 
noisy environment); (3) financial outcomes (e.g., cost-effec- 
tive care, average number of sessions); (4) social outcomes 
(e.g., employability, ability to interact with others); and 
(5) patient-defined outcomes (e.g., satisfaction with services 
and quality of life). Treatment outcome research has a hier- 
archy ranging from the highest level of experimental, or class 
I, classification to the lowest level of nonexperimental, or 
class Ill, classification. Class I includes randomized con- 
trolled clinical trials and time series research, whereas class 
IHI has no clear comparison groups and includes case studies 
or reports, registries and databases, and group judgments or 
expert opinion. According to Fratalli, “Traditional thinking in 
the medical field reserves randomized controlled clinical tri- 
als as the only accepted method of experimental research—a 
position refuted by clinical researchers who employ small- 
group or single-subject designs.” Verdolini, Ramig, and Jacob- 
son (1998) organized by decade more than 100 voice therapy 
articles. Few of the articles were class I, but they derived six 
conclusions about the benefits and success of treatment: 


+ Functional aphonias may benefit from facilatory, vegeta- 
tive voicing maneuvers. 


+ Some types of intensive therapies benefit voice problems 
related to Parkinson’s disease. 


è Conventional voice therapy may provide a protective 
effect against recurrence. 


+ Conventional voice therapy and resonant voice therapy 
may produce benefits in treatment of nodules. 


+ Intensive hydration treatments may be beneficial in treat- 
ment of nodules. 


+ Hyperfunctional voice disorders may benefit from sys- 
tematic electromyography (EMG) or laryngeal relaxation 
training. 


The conclusions reached from Verdolini and colleagues’ 
scholarly literature review does not mean treatments not 
listed are not effective. Rather, it suggests that at the time of 
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their review, we did not have evidence of the efficacy of the 
other techniques. In this chapter, we do not limit techniques 
to those that are evidence-based because too many of the 
techniques commonly used do not have class I or II outcomes 
and would have to be omitted. Rather, we provide an 
overview of the major techniques and methods described by 
experienced clinicians providing their expert opinion (class 
III). Some of these methods also have case reports but few 
have randomized controlled clinical trials. 


Auditory Training 


Voice lacks finite acoustic boundaries. A mildly defective 
voice can become incorporated into the self without much 
notice, especially in children, whose concerns over its social 
acceptability are minimal. Even in severe voice disorders, the 
person may be upset over the voice but has long forgotten 
the sound of the normal voice and, therefore, any notion of 
how it ought to sound again. 

For these reasons, direct or indirect auditory training, 
defined as teaching the identification of and discrimination 
among different voices, is part of most voice therapy. Audi- 
tory training may simply be heightening a patient’s aware- 
ness of differences between normal and defective voices 
without directly discussing the differences. It can be indirect 
auditory feedback indicating when a patient hits or falls short 
of the target voice. In direct auditory training, the underlying 
assumption is that before a better voice can be achieved, the 
person has to know how his or her voice sounds. This 
assumes that patients of all ages need to hear the differences 
between normal and defective voices before they can pro- 
duce a normal voice. After comparing other voices with their 
own, they discuss those differences with the clinician. In the 
not too distant past, ear training was the initial prescribed 
phase of voice therapy. Ear training included listening to spe- 
cial sounds such as clock ticking and car horns, tapping or 
clapping in imitation of a sound pattern, differentiating 
between sound pairs of /m-n-n-n-m/ versus /n-n-n-m-n-n-n/, 
comparing doorbells with car horns or telephone rings, and 
listening to “good” and “bad” voice. This still may be useful in 
helping direct children’s attention to sound production but 
generally is not necessary in working with adults. 

Whether auditory training is direct or indirect, of major 
importance is instantaneous auditory feedback. What is sin- 
gular about voice therapy is that in the early, critical stages, 
improved voice will break through suddenly and momentar- 
ily, milliseconds in duration; although the clinician may hear 
and identify these gains, the patient usually does not. Conse- 
quently, the clinician needs to listen carefully and, when the 
voice changes, for the better or worse, communicate that 
change instantaneously to the patient. Initially, this instant 
feedback (“That’s it,” “Too high,” “Too low”) may come with 
each change or attempt to hit a target voice. Once the patient 
understands the target and can detect the change, the clini- 
cian’s feedback is quickly faded. 


Respiratory Control 


If expiratory air volume is patently insufficient because of 
neurologic weakness or incoordination, vocal loudness and 


steadiness will suffer. In such cases, improving a patient’s 
ability to produce a constant pressure by increasing lung 
volumes available for voice and control of smoother exhala- 
tion may be desirable ends. However, often for voice 
patients, respiration is anatomically and physiologically 
normal. Abnormal breathing patterns may be due to poor 
habits or posture, anxiety and tension, paradoxical vocal 
fold dysfunction, or poor coordination or lack of respiratory 
support that is often related to neurologic disease. These 
abnormal breathing patterns frequently result in failing to 
provide sufficient breath support for optimum voice. 
Abdominal breathing, often misrepresented as diaphrag- 
matic breathing, affords the greatest lung volumes. Cooper 
and Cooper (1977) enhance abdominal awareness and 
breathing coordination to encourage maximum support 
giving the following advice. 
Lying supine, the patient is asked to: 


+ Place one hand on the chest and the other on the 
abdomen. 


+ Inhale easily through the nose and exhale through the 
mouth, concentrating on inflating and deflating the 
abdomen but not the chest. 


+ Inhale quickly through the mouth only and exhale gradu- 
ally, using the same abdominal breathing method just 
described. 


Sitting, and then standing, the patient is asked to repeat 
numbers (1) to (3), incorporating the abdominal breathing 
into phonation at proper pitch and loudness levels. 

Gravitational forces and consequent work of the abdomi- 
nal muscles are different for the supine posture compared 
with sitting and standing (Hoit, 1995). Thus, the supine posi- 
tion because of the work against the weight of the abdominal 
contents enhances awareness, and the upright postures help 
transfer the breathing patterns to phonation. Some clinicians 
take the supine step a bit further by placing a book on the 
abdomen and/or by using a mirror or online video recording 
to aid the patient in observing his or her abdominal move- 
ments. 

For development of abdominal-intercostal breathing coor- 
dinated with phonation, Greene (1972) advocates the fol- 
lowing method: 


¢ Breathe in slowly and then out, counting in a quiet voice 
up to 4 and then gradually increasing the count at a rate of 
one per second. No breath should escape between each 
count. 


Inhale deeply, and, keeping the ribs elevated, count to 15, 
gradually letting the ribs descend between the count of 
15 and 20. 


+ Inhale and, on exhalation, sustain /s/ or /f/ steadily, trying 
not to let it fluctuate or fade toward the end. Feel the 
gradual contraction of the abdominal wall as breath is 
exhaled. 


+ Repeat this, except with a crescendo and diminuendo or 
in different rhythmic patterns. 
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For reinforcing the proper respiratory rhythm for speech, 
she instructs the patient to: 


+ Breathe in and out slowly several times and then imitate 
the clinician’s rapid intake and slow exhalation. The 
abdomen should jump forward on inhalation and subside 
gradually on exhalation. 


¢ Inhale quickly and exhale slowly, counting to 6. Gradually 
increase the count on exhalation to 20. 


Accent Method 


The Accent Method, developed by Svend Smith of Denmark 
more than 50 years ago (Dalhoff and Kitzing, 1989; Smith 
and Thyme, 1976; Thyme-Frokjaer and Frokjaer, 1987, 
Thyme-Frokjaer, 1998) is a holistic method that relies heav- 
ily on changing abdominal support for voice and incorpo- 
rates paced rhythmic movements into new motor speech 
breathing patterns. Based on aerodynamic principles of 
phonation, Smith incorporated abdominal accents and open 
airway into speech breathing to facilitate easy clear phona- 
tion with appropriate resonance balance. This programmatic 
method refocuses voice production from the larynx to the 
abdomen and promotes free open flow of air through the 
vocal tract. Several studies have documented the changes 
that take place as a result of this treatment program. Thyme- 
Frokjaer and Frokjaer have documented that breathy 
untrained voices achieve a longer glottal closure and creaky 
untrained voices achieve a shorter glottal closure suggesting 
that phonatory function has been normalized. They have 
also demonstrated acoustic changes: energy is increased in 
the F2 and F3 regions; irregular pitch disturbances are 
reduced; and variations in voice timbre are increased. Simi- 
larly, Kotby, Shiromoto, and Hirano (1993) have documented 
increased airflow, reduction in lesions, and improved voice 
in individuals with functional voice disorders and in individ- 
uals with mass lesions. Bassiouny (1998) demonstrated sig- 
nificantly better results for voice-disordered patients who 
had Accent therapy combined with vocal hygiene than for 
those who had vocal hygiene alone. She attributes the better 
intervention results to the holistic approach of the Accent 
Method. Verdolini (1998) suggests its effectiveness may be 
related to training new motor patterns. She reasons that 
because timing is fundamental to the development of skilled 
behavior, the Accent Method’s emphasis on timing and 
rhythmic movements of the body may facilitate new motor 
programs. 

There are four major stages in the Accent technique, which 
is also coupled with individually tailored vocal hygiene pro- 
grams. The training stages are supine; sitting; standing; and 
carry-over. The training stages include accentuated phona- 
tory pulsations on both voiced and voiceless sounds and 
reading of special texts that help facilitate patient awareness 
of differences between accentuated and unaccentuated con- 
texts. As originally conceived, the program took 30 or more 
sessions, which is generally impractical for functional voice 
disorders being treated in today’s clinics located in the 
United States and has been modified accordingly. The respi- 
ratory training is designed to produce a dominant abdominal 


respiration pattern. Exercises of accentuated phonatory pul- 
sations, often done in response to drumbeats, are used to 
help the patient become aware of accent patterns. Like the 
respiratory training of Cooper and Cooper (1977), the Accent 
treatment program begins in the supine position with hands 
placed on the abdomen. The patient is asked to take in a deep 
breath and breathe out slowly against slightly pursed lips in 
imitation of the clinician producing distinct rhythmic 
accented breath pulses. The clinician initially uses hand 
motions to direct the patient when air should be flowing into 
or out of the body simulating the pattern modeled by the cli- 
nician. Both length and speed of exhalations are varied 
between 2 and 6 seconds. Once the patient has accom- 
plished this slow inhalation/exhalation breathing pattern, 
slow accented beats are added. With each beat, the patient 
gives a punch, or stress, to the exhaled air using the abdomi- 
nal muscles. No sound is added until the patient can stress to 
the beat on voiceless exhalation. This rhythmic movement is 
repeated using /s/ /S/ /f/. The beats may be changed from the 
slow and broad largo to fairly slow andante and then the 
more rapid allegro. The next step returns back to the slower 
beat and adds voiced continuants /z/ /dz/ /v/ produced ina 
light rhythmic manner. This sequence is then repeated in a 
sitting position. One of the patient’s hands is placed on the 
chest and the other on the abdomen to continue making him 
or her aware of the role of the abdominal muscles and chest- 
wall movement patterns. Nonsense syllables and inflection 
patterns are gradually added. The training sequence is 
repeated with the clinician and patient standing and adding 
body movements swaying to and fro and slighting swinging 
relaxed arms. If at any point the patient fails, the clinician 
goes back to earlier successful steps before proceeding. Up 
until the point the new breath support pattern is to be incor- 
porated into conversation, the patient’s task is always to imi- 
tate the clinician to the beat presented. Clinicians who favor 
this technique generally use bongo drums to provide the tar- 
get beats. To learn this technique, clinicians generally need 
to practice with an experienced clinician or audio or video 
samples (Khidr, 2005; Khidr, Lantz, and Senatore, 2005; 
Kotby, 1991; Kotby, Shiromoto, and Hirano, 1993; Thyme- 
Frokjaer and Frokjaer, 2001, Kotby, 1995). 

Davenport and Sapienza (2004) suggest that the term 
breathing exercises encompasses a broad spectrum of treat- 
ments, ranging from relaxation exercises to music therapy, 
which may be taught in a mystic manner with few physio- 
logic guidelines. They state that although the validity of 
these exercises has not been verified, the successful tech- 
niques “may work by facilitating production of the voice at 
higher lung volumes or by taking advantage of the appropri- 
ate gravitational effects that direct breathing in an expira- 
tory direction.” 

After many years of experience administering voice ther- 
apy and attempting to deal with problems of respiration that 
often accompany voice disorders, Boone (1988) has taken a 
more conservative stand on its relative importance in voice 
therapy, although he continues to believe in the importance 
of modifying respiration in voice disorders. For the patient 
who demonstrates faulty respiratory patterns, he believes 
that some modification of respiration for speech is impor- 
tant, using feedback and patient education about optimum 
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respiratory actions. However, over the years, he has observed 
that many patients develop overconcern about how they are 
breathing, and their attention to the breathing process has 
actually interfered with a smooth and easy respiratory style, 
often increasing their laryngeal tension. 

Our thesis might well be that we best alter respiratory pat- 
terns by working to extend expiratory control, matching tar- 
get models, and learning how effortlessly to renew breath. 
The best technique for altering respiratory patterns, or the 
degree to which this is necessary to bring about best voice, 
has not been determined. 


Relaxation 


Relaxation as an adjunct to voice therapy is commonly advo- 
cated to combat excessive musculoskeletal tension. A 
plethora of descriptions of voice therapy techniques suggest 
that a more relaxed means of phonation is a primary goal. 
Relaxation as a primary or augmentative technique is exem- 
plified by the progressive relaxation techniques of Jacobson 
(1938, 1964, 1976) and McClosky (1977). These techniques 
try to teach recognition of the difference between feelings 
associated with muscular tension and those associated with 
relaxation by having the patient alternately contract and 
relax head, neck, thoracic, and abdominal muscles. Biofeed- 
back methods have also achieved relaxation, with the 
patient auditorily and visually monitoring the degree of 
muscle tension of the face and neck. 

Another technique designed specifically to facilitate relax- 
ation of the laryngeal, pharyngeal, and oral structures is 
chewing. The chewing method of speech muscle relaxation 
(Froeschels, 1952) is based on the theory that chewing, being 
a primitive, reflex-like, semiautomatic function, relaxes the 
lingual, mandibular, hyoid, and laryngeal musculature. The 
method advocates phonation while performing chewing or 
munching movements with the tongue and jaw. The pro- 
gression of steps includes heightening awareness of chewing 
movements while chewing on a bland cookie or cracker, 
chewing silently like a savage with exaggerated mouth and 
tongue movements, adding vowels while chewing, speaking 
nonsense words while chewing, speaking short meaningful 
phrases while chewing, and finally carrying the relaxed, 
improved movements into conversation. This chewing pro- 
gression often involves mirror work to help emphasize the 
importance of increased orality. Its value may be limited in 
patients who are embarrassed by the unusual nature of this 
therapy. 


Yawn-Sigh 


Boone and McFarlane (1993) advocate using yawning to 
open the inlet to the larynx and promote relaxation of the 
pharynx. The technique is simple to execute. Clinicians sim- 
ply instruct the patient to yawn and then exhale with a long, 
breathy sigh. This technique helps patients contrast a hyper- 
functional constricted airway with one that is more open. 
Xu, Ikeda, and Komiyama (1991) used respiratory kinematic 
sensors to provide biofeedback of a yawning breath pattern 
(YBP) to 91 voice-disordered patients. They reported signifi- 
cantly better results on voice tests and subjective evalua- 


tions for those patients who had the ability to master the 
YBP. Recent research has demonstrated that yawning has a 
physiologic basis that promotes relaxation. Yawning is 
induced by dopamine receptor agonists. 


Tone Focus 


Deeply ingrained in the history of therapy for voice disorders 
is the concept of optimum pitch (i.e., that there is a pitch 
level best suited to the supraglottic resonators that produces 
the most resonant voice with the least physical effort), 
located approximately one-fourth of the total pitch range 
from the bottom (Fairbanks, 1960). Whereas there is little 
doubt that balance of resonance and range are disturbed in 
virtually all voice disorders, it is not necessary to make a 
point of finding a correct pitch level and purposefully teach- 
ing it to the individual. In fact, one might argue that clini- 
cians would be misguided to search for a single optimal pitch 
and rather should work to produce a tone focus that is a bal- 
ance of resonance across the patient’s pitch range. Hence, 
the speaking level best suited to the supraglottic resonators 
that produces the most resonant voice with the least physi- 
cal effort will relate to the size and shape of the vocal instru- 
ment and the power provided by the respiratory driving 
force. Each person who has a nonorganic voice disorder has 
the inherent capacity to phonate with a proper tone focus, 
but the muscular tensions underlying the disorder have 
pulled the larynx out of optimum position. When the mus- 
culature has returned to its normal state, the correct pitch 
usually follows often, though not always, accompanied by a 
natural balance of resonance. When the voice disorder is one 
of incorrect pitch and resonance, the clinician can roughly 
demonstrate both how far off the patient is and the extra 
effort required to produce voice, by helping the patient find 
the best pitch level that is physically easiest to produce. The 
“um-hum” method proposed by Cooper (1973) is ideal for 
this purpose. 

The patient is asked to say “um-hum” using rising inflec- 
tion with the lips closed, as though he or she were sponta- 
neously and sincerely agreeing with what was just said. It is 
vital to underscore the fact that this “um-hum” be sponta- 
neous and sincere. A natural “um-hum,” which is easy and 
gentle, will be felt around the sides and lower portion of the 
nose and around the lips. To help identify the placement of 
tone focus, the fingers of one hand should be placed lightly 
on the bridge and sides of the nose while the other hand is 
placed lightly on the throat. As one says the “um-hum,” there 
should be a slight vibration or tingle in the mask area. 


Resonant Voice Therapy 


In their classic review of voice therapy techniques, Verdolini, 
Devore, and Ostrem (1998) state “. . . that there are probably 
many different ways to produce ‘good voice.” They include 
“forward focus” or “resonant voice” as one of the successful 
approaches. Based on work described by Arthur Lessac, Ver- 
dolini developed a programmatic approach to resonant 
voice. The program is based on the knowledge that when 
resonant voice is produced, the vocal folds barely touch 
while producing a strong, clear voice requiring the least 
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amount of lung pressure resulting in low-impact stress on 
the vocal fold tissues (Verdolini, Druker, Palmer, et al, 1998). 
Verdolini’s systematic 8-week program teaches the reso- 
nant voice gesture by placing emphasis on sensory informa- 
tion and exploration. This is based on mind-brain models of 
skill acquisition that suggest feeling, hearing, and seeing are 
more important in learning new skills than are explanations 
of the physiology of production. Rammage (1997) developed 
patient practice handouts on resonant voice therapy to sup- 
plement treatment. The 10-step instruction summary helps 
illustrate the systematic approach to this technique. 


1. “Sit as tall as you can with your head resting comfort- 
ably on top of your spine. You should be able to move 
your head easily from side to side. Take care not to 
slouch or raise your chin in the air. Once seated com- 
fortably with your arms resting at your sides and your 
shoulders relaxed, take a few deep breaths. When 
exhaling, maintain a continuous breathstream. If you 
feel any resistance in your chest or neck when exhaling, 
take a few more breaths until you are exhaling freely. 
You are now ready to begin to explore your resonant 
voice.” 


2. “On a light tone that feels comfortable to you, hum at a 
comfortable pitch to begin feeling ‘buzzing’ or vibratory 
sensations in your head bones and facial tissue. Your lips 
should be closed lightly and your neck, throat, shoulders, 
and jaw should continue to feel free. Repeat the hum 
several times, noticing where you feel ‘buzzing.’ The 
buzzing sensation that you feel is resonance.” 


3. Patient sustains a hum. “Hmmmmmmmmmmmmmm- 
mmmmmmmnm...” 


4. “Where do you feel the buzz? In the lips, nose, sinuses, 
forehead, top of your head, hard palate, teeth, else- 
where?” 


5. “After noticing where you feel the buzzing, or resonance, 
vary the pitch on which you are humming. Hum on a 
sigh and slide down several pitches, keeping your neck, 
shoulders, and jaw free and relaxed.” 


6. “Hum while doing a siren, varying the pitch up and then 
down.” 


7. “Do you notice that some pitches feel more buzzy than 
others? Repeat the siren several times and pay attention 
to the most resonant, or loudest, pitch. If you find a pitch 
or tone that feels particularly effortless and powerful, 
hold that pitch for a moment and then say the word 
‘man.’ Feel the resonant buzz throughout the word.” 


8. “Hum.” Hmmmmmmmmmmmmmmmn.... 


9. “Let your pitch vary naturally. Focus on the feeling of res- 
onance as you say the entire word or phrase and allow 
the pitch to vary naturally. The intensity of the resonance 
or buzzing will vary with pitch. The following words, 
phrases, and sentences were chosen because they con- 
tain the most resonant consonants that we produce, ‘m’ 
and ‘n.” Here patients are provided a list of 50 to 60 
words such as mime, moan, running. 


10. “Feel the resonance through the final consonant. Your 
throat, jaw, shoulders, and neck should feel free and 
relaxed while you are using resonant voice.” Patients are 
then given short phrases, such as “more money” and 
“home team,” to practice. This is followed by lists of 
longer sentences such as “Many men were mining” and 
“Come with me and have some of mom’s marvelous 
homemade jam” and “Mark and Mary may be married in 
the end of May.” Additionally, Lessac’s “Ybuzz” sounds 
are practiced in syllables, words, and sentences. These 
are sounds that incorporate frontal lip and tongue place- 
ments for production of /b/ /f/ /1/ /p/ Isl. 


It is interesting to note that humming may have additional 
health benefits particularly to the extent that sinus infec- 
tions are thought to be a contributory factor to voice problems. 
Weitzberg and Lundberg (2002) reported that production of 
nitric oxide (NO) increased 15-fold during humming com- 
pared with quiet exhalation. These researchers suggested 
that NO helps blood vessels dilate, thereby allowing oxygen- 
carrying blood to flow more freely. Humming appeared to 
facilitate exchange of air from the sinuses to the nasal pas- 
sages, which may, in turn, help ventilate the sinuses, protect- 
ing them from developing infections. 


Lip Trills 


An alternate method to achieve resonance balance is the use 
of trills. Lip and tongue trills are used to provide the patient 
a feeling of proper tone focus. Patients are asked to glide up 
and down their pitch range while doing either lip or tongue 
trills. During the trill, the aerodynamic balance within the 
vocal tract is optimal and there can be no strain on the laryn- 
geal apparatus. Patients who initially have difficulty produc- 
ing trills often find placing their hands gently against their 
cheeks facilitates the production of lip trills or lip bubbles. 


Trial and Error 


It may seem paradoxical, but despite established principles 
and techniques of voice therapy, trial and error is an essen- 
tial methodology. Far from being a science, voice therapy is 
pragmatic and demands that the clinician take advantage of 
accidental, unplanned voice improvements during therapy. 
Boone (1977) states this principle clearly when he says that 
the clinician must continually “search for the patient’s best 
and most appropriate voice production. This searching is 
necessary because so much of our vocal behaviors are highly 
automatic. The patient cannot volitionally break vocalization 
down into various components and then hope to combine 
them into some ideal phonation. Our therapy techniques are 
primarily vehicles of facilitation; that is, we try a particular 
therapy approach to probe and see if it facilitates the pro- 
duction of a better voice. If it does, then we utilize it as ther- 
apy practice material. If it does not, we quickly abandon it. As 
part of every clinical session, we must probe and search for 
the patient’s best voice. We use it as the target model in 
therapy.” 

The question then is not whether we should use probe 
therapy but how we use trial and error to elicit the target 
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voice. How long do we try a facilitating technique before 
abandoning it and moving on to another? How much of each 
therapy session should we probe to search for the patient’s 
best voice? 

There is no single answer to these questions. With experi- 
ence, clinicians learn to listen to the voice presented by the 
patient during the first few minutes of an encounter to make 
judgments about the origin of the disordered voice. During 
the initial diagnostic session, probe therapy is used to see if a 
“better” voice can be produced by increasing or decreasing 
closure, loudness, pitch, resonance, or breath support. It may, 
or may not, involve use of visual feedback of the acoustic, 
aerodynamic, or visual image of some part of the vocal tract. 
Time is usually limited to no more than 15 minutes. Facilitat- 
ing techniques used are generally those that require little 
explanation and have the highest probability of making 
change. For example, with a patient presenting with a weak 
breathy voice, techniques directed at increasing closure or 
breath support are the most likely to facilitate change. Using 
vegetative initiators like a cough or laugh, tried three to four 
times, is often enough to provide the clinician with a target 
model of a better voice or with knowledge that an alternative 
treatment may be indicated. Similarly, with a patient pre- 
senting with a loud strained voice, three to four imitations of 
the clinician sighing, or using a breathy voice, might be suffi- 
cient to determine the initial target. In subsequent session(s), 
as the patient’s voice improves, the target of “best voice pos- 
sible” often changes and is determined by perceived vocal 
differences, or lack thereof, in therapy. As in most behavioral 
changes, movement toward achieving the target is rarely a 
straight line. The patient gets better, regresses, and gets bet- 
ter again. When the patient becomes stuck or discouraged 
with progress using a particular technique, the clinician often 
uses trial and error with other techniques to help select a dif- 
ferent technique to move the patient forward. A few basic 
principles may help the beginning clinician in using trial and 
error searching for a target voice: 


¢ Trial and error should not be used as an excuse for a shot- 
gun approach. Clinicians should be systematic in their 
approach and selective in techniques used taking care to 
match treatment and problem. 


¢ Trial-and-error therapy should not be the major focus of 
every treatment session. Once a treatment plan is devel- 
oped, trial and error are used to readjust targets and 
determine if a new technique could return the voice to 
greater heights. 


¢ Trial-and-error failure at one point in therapy does not 
mean the technique would not work in the future. Trial- 
and-error success does not mean that it is the only tech- 
nique that would work for an individual patient. Voice is 
dynamic: as the patient gains voice, techniques that did 
not work well in the beginning may later become excel- 
lent facilitators. 


¢ Efficient application of trial-and-error therapy requires 
clinicians to understand the physiology of voice produc- 
tion. Clinicians select techniques to try based on their 
understanding of the physiology being used to produce 
the aberrant voice. 


+ To help improve efficiency of practice, clinicians should 
model trial-and-error tasks before requiring the patient 
to produce them. 


+ Although both use the same facilitating techniques to 
determine vocal status, trial-and-error therapy should 
not be confused with the use of prognostic probes. In 
therapy, the technique is used until it is no longer func- 
tional. When the same technique is used as a prognostic 
probe, the technique is used briefly to predict if and how 
the voice might improve further. 


This suggests that all direct voice therapy could be consid- 
ered a facilitating technique. In their book on voice therapy, 
Boone, McFarlane, and VonBerg (2005) table facilitating 
approaches alphabetically and the parameters (pitch, loud- 
ness, and quality) of voice they affect. Because these are the 
perceptual parameters clinicians use to judge the effective- 
ness of a technique, it makes good sense to list them this way 
though the parameters are not independent, and often 
changing one parameter will affect one or several other fac- 
tors. Another way to look at these techniques would be by 
the physiologic parameters of vocal fold vibration they 
change. Rattenbury, Carding, and Finn (2004) use a different 
schema for listing voice therapy techniques. They divide 
them into indirect voice therapy techniques, direct tech- 
niques, and techniques used as prognostic probes. They base 
their list of prognostic probes on the Morrison and Rammage 
(1993) classification of voice disorders and muscle misuse. 
This physiologic schema classifies techniques based on 
whether they affect isometric closure pattern, glottic con- 
traction, supraglottic (lateral) contraction, anterior-posterior 
contraction, or total incomplete adduction. This, too, makes 
sense, particularly when clinicians have means to visualize 
the larynx, because not all voice problems are resolved even 
when effort level or vibratory patterns are improved. For 
example, if vocal fold closure were judged to be excessive, 
techniques of confidential voice, digital manipulation or 
laryngeal massage, relaxation, visual feedback, and yawn- 
sigh might be tried. Conversely, probes for inadequate 
approximation might include increased or effortful loud- 
ness, respiration training, pushing, pitch glides, and visual 
feedback. The advantage of the latter approach is that clini- 
cians can be more precise in their selection of a facilitating 
technique and hence more efficient. For example, not all 
problems of vocal loudness emanate from the same etiology. 
Probes for loudness problems caused by inadequate closure 
would be different than probes selected for loudness prob- 
lems caused by poor breath support, by laryngeal scarring, or 
by poor motor control. Thus, the clinician needs to know the 
underlying physiologic effects of each technique to select 
and apply them in a meaningful manner to determine target 
models, plateaus, and time to select additional techniques or 
terminate therapy. 


Voice Rest 


Voice rest is an indirect voice therapy technique calling for 
reduction or abstinence of vocal use. It is based on the 
assumption that the absence of vocal fold impact stress 
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promotes tissue healing, reduces scar formation, enhances 
healing in the injured larynx, and reduces vocal fatigue. 
Based on these assumptions, physicians may prescribe 
voice rest as part of postoperative care, as a preventative 
measure to reduce vocal fatigue, in the case of vocal fold 
hemorrhage to prevent further bleeding, and in benign 
lesions to reduce the size of the lesion. Like most other 
voice therapy techniques, there is little evidence either to 
support or refute these assumptions nor is there much con- 
sensus on either its necessity or efficacy. Nevertheless, any 
treatise on vocal treatment would be incomplete without a 
thorough discussion of the theory and role of vocal rest in 
treatment of dysphonias and the controversies surrounding 
its use. 

In the past decade, research on changes in the microstruc- 
ture of the larynx in canines after tissue trauma has led to 
theories about what happens to the healing structure with 
microtrauma, which in turn has led to recommendations for 
voice rest after phonosurgery. In theory, any interruption of 
microcirculation in the vocal folds leads to temporary 
ischemia, disruption, and increased permeability of capillar- 
ies and ultimately to reorganization of the affected structure 
and fibrous mass or edema. Gray Pignatori, and Harding 
(1994) demonstrated some of these organizational changes 
in the basement membrane zone (BMZ) and the superficial 
layer of the lamina propria. The histoarchitecture of the BMZ 
became disrupted, and the structural glycoproteins and 
fibrous proteins normally seen in the speech language 
pathologist (SLP) were nearly absent. Durkin, Duncavage, 
Toohill, et al (1986) injured the vocal folds of eight canines 
and examined the folds at 1, 2, 3, 5, 7, 10, and 32 days post- 
operatively to follow the healing process. They observed 
edema and tissue necrosis at all time points. These findings 
were augmented by a Korean study adding an additional 
twist on the influence of phonation on basement membrane 
zone recovery after phonomicrosurgery (Cho, Kim, Lee, et al, 
2000). These investigators used 20 dogs placing 10 in a situ- 
ation to simulate iatrogenic voice rest and 10 allowed to 
phonate normally. Healing process was observed histologi- 
cally at 1, 2, 4, 8, and 12 weeks after surgery. In the voice rest 
group, the basement membrane was completely re-formed 
in 2 weeks, and the “cover” appeared completely rearranged 
by 8 weeks. In contrast, the phonation group had delayed heal- 
ing and basement membrane changes. Based on these 
canine results, the authors suggested that voice rest of at 
least 2 weeks after surgery may be beneficial and that vocal 
hygiene should be maintained for as long as 8 weeks. How- 
ever, it has been long known that the canine larynx is not 
identical to that of humans (Jako, 1972). Jako was one of the 
first to note that compared with humans, the canine has only 
a rudimentary vocal ligament and a thicker lamina propria, 
though it does have a similar basement membrane. Thus, cli- 
nicians must be cautious in making direct extrapolations 
from canines to humans. 

Whereas systematic histologic studies of the daily change 
in the ultrastructure of the vocal fold cannot be observed in 
humans, the quality of the voice and endoscopic image can 
be sequentially observed after phonosurgical treatment. In 
one such study, Wang and Huang (1994) studied 25 patients 
placed in a sequence of three postoperative programs (voice 


rest, vocal hygiene, voice treatment) and concluded that 
voice rest appeared to promote healing and should be used 
as part of a holistic voice treatment plan after phonosurgery. 
Ye, Yang, Zhao, et al (2002) were also interested in the role of 
voice rest in tissue healing postoperatively. They approached 
the problem by following the voice recovery of 20 patients 
who had polypectomies. Daily voice assessments revealed 
the worst voice occurred in day 1, recovered significantly by 
day 10, and reached normal by day 14. They concluded from 
these observations that after polyp removal, patients should 
be placed on 14 days of vocal rest. 

In combination, these experimental time-series studies 
appear to provide some evidence of the effectiveness of 
voice rest, yet it is unclear the frequency with which clini- 
cians employ voice rest in treatment. In a prospective study 
attempting to determine if voice rest was the standard of 
care after excision of benign mucosal vocal fold lesions, 
Behrman and Sulica (2003) mailed a 16-item survey to all 
active members of the American Academy of Otolaryngology 
(n = 7321). The majority of the 1208 respondents favored 
some form of postsurgical vocal rest for 7 days, but only 18% 
of them recommended complete vocal rest, and 15% did not 
recommend any type of voice rest. 

A recent vocal advocate of voice rest, Gwen Korovin argues 
a strong case for including voice rest as part of phonosurgical 
care but does not limit it to postoperative care and recog- 
nizes voice rest is not the only contributing factor to vocal 
fold healing. In her comprehensive discussion of issues 
related to voice rest, Korovin’s recommendations for nonop- 
erative vocal rest include acute edema, prenodules, hemor- 
rhage, mucosal tears, and ulcerations. Similarly, Sataloff and 
Hawkshaw (2006) state that, 


Voice rest (absolute or relative) is an important therapeu- 
tic consideration in any case of laryngitis. When no pro- 
fessional commitments are pending, a short course of 
absolute voice rest may be considered, as it is the safest 
and most conservative therapeutic intervention. This 
means absolute silence and communication with a writ- 
ing pad. The patient must be instructed not to whisper, as 
this may be an even more traumatic vocal activity than 
speaking softly. Whistling through the lips also involves 
vocal fold activity and should not be permitted. So does 
the playing of many musical wind instruments. Absolute 
voice rest is necessary only for serious vocal fold injury 
such as hemorrhage or mucosal disruption. Even then, it is 
virtually never indicated for more than 7 to 10 days. Three 
days are often sufficient. 


Korovin’s advice for voice rest associated with phono- 
surgery depends on the site and extent of the vocal fold 
lesion, the type of corrective surgery, and is often combined 
with behavioral management. This combination of treat- 
ment is also advocated by Murry (2001) and Behlau (2005) 
particularly in cases at greater postoperative risk for healing 
problems such as patients who are smokers with polypoid 
corditis or patients who have had significant tissue resec- 
tions. 

The vocal rest program elucidated by Korovin is a four-stage 
postoperative treatment (preoperative, initial postoperative, 
first postoperative exam, and subsequent postoperative). 
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During the preoperative period, the patient is provided edu- 
cation about the medical condition and voice production 
and provided medical management for problems contribut- 
ing to tissue irritation. Voice therapy is focused on minimiz- 
ing the associated edema. The initial postoperative period, 
which immediately follows surgery, includes voice rest, 
education about subsequent stages, support systems, and 
preparation for easy voicing. The first postoperative exam is 
completed at 1 week. The larynx is visualized to determine 
the patient’s ability to transition from voice rest to voice 
conservation and the subsequent postoperative treatment. 
The patient needs to understand the importance of voice 
conservation for the initial critical healing period. The 
extent of voice conservation varies by a patient’s voice 
needs, ability to comply with recommendations (i.e., spe- 
cific time limits recommended for vocal usage vs. good 
judgment), and the degree and type of surgery. Subsequent 
postoperative treatment includes practice of vocal hygiene: 
hydration and humidity, mucolytics, antitussives, and tun- 
ing in to listen to body and voice. Postoperative vocal 
hygiene also includes avoiding smoke, throat clearing, 
coughing, reflux, and allergens. Korovin (2005) suggests cli- 
nicians consider perioperative medications that may affect 
healing (steroids, antireflux medications, antibiotics, anti- 
fungals, oral inhalers, hormone therapy, and blood thin- 
ners). Other factors influencing postoperative voice rest 
recommendations include the availability of an experienced 
voice care team, extent of preoperative voice therapy, com- 
pliance with vocal hygiene, primary or secondary surgery, 
and appearance and speed of healing. During periods of 
absolute vocal rest, patients are instructed to use pen and 
paper, children’s write/erase boards, unvoiced noises such 
as hand clapping, finger snapping, hissing, and e-mail and 
cell phone text messaging. For the period of voice rest, any 
unnecessary behavior that has the potential to approximate 
the vocal folds is to be avoided. Patients are instructed that 
they should not whisper, mouth words (because they might 
subvocalize), play wind instruments, whistle, or lift heavy 
objects. An equally important part of the vocal rest is to 
avoid crying because it causes increased subglottal pressure 
and upper thoracic tension, rapid vocal fold closure, and 
prolonged closed phase. During voice conservation behav- 
ioral therapy instruction on use of confidential and resonant 
voice, vocal function exercises and/or the Accent technique 
are often employed. Korovin argues against concerns about 
voice rest resulting in muscle atrophy. Though cognizant 
that opinions on this topic vary, she makes a case that 
wound-healing studies indicate 1 to 2 weeks is appropriate 
for the majority of remucosalization to occur, and that there 
are no studies demonstrating atrophy after 2 weeks of 
voice rest. She also addresses the notion that voice rest has 
negative psychologic ramifications. Again, with little data it 
is difficult to discern the psychologic implications because 
there is so much variance between patients. She argues that 
in select patients, voice rest may induce feelings of insecu- 
rity, helplessness, and dissociation from the verbal world, 
but with proper preoperative counseling/education, the 
feelings should be temporary. 

A perennial problem underlying voice rest recommenda- 
tions is different notions as to what constitutes “voice rest.” 


Behlau is very clear in her set of voice recommendations for 
patients after benign lesion surgery. When voice rest is rec- 
ommended, she tells patients, “... do not use your voice in 
any circumstance, don’t whisper or murmur, don’t cough 
with effort and sound, don’t clear your throat and don’t 
speak while doing activities such as defecation or sexual 
intercourse.” Although she does not describe specific 
stages, she suggests patients return to voice activities grad- 
ually, initially being restricted only to necessary situations. 
She further recommends that during the first 2 postopera- 
tive weeks, patients be instructed when returning to voice 
to avoid speaking on the phone, especially on cells; avoid 
noisy places; and to refrain from speaking loudly or for long 
periods. 

The need to incorporate vocal rest into treatment pro- 
grams is also supported by Hoover, Sataloff, Lyons, et al 
(2001). They suggest that although there are no scientific 
studies to confirm its efficacy, a brief period of complete 
voice rest is the standard of care and appears to be helpful in 
avoiding adverse sequelae and advancing the healing 
process in the case of mucosal tears. 

Murry and Rosen (2000) also support the use of voice rest. 
They successfully treated three cases of vocal fold hemor- 
rhage, secondary to crying, with voice rest and subsequent 
phonomicrosurgery for lesions. 


Summary of Voice Rest 


Thus, voice rest seems logical based on what we know 
about healing characteristics but remains controversial. 
Voice rest recommendations generally fall under one of 
three classifications: complete/absolute voice rest, rela- 
tive/modified voice rest, or voice conservation. Commonly 
complete voice rest advice given to patients with voice dis- 
orders is to refrain from using the voice; this means not 
talking at all including no whispering, humming, or throat 
clearing. Relative voice rest limits vocalizations in time, 
intensity, and frequency, often suggesting talking at a 
reduced volume, as if you were no more than an arm’s 
length away from the listener, and for no more than 5 min- 
utes/hour. Advocates of voice conservation advise patients 
to limit vocalizations to times when they must speak and 
otherwise conserve the voice using voice rest coupled with 
hydration. 

The duration of voice rest, modified or complete, recom- 
mended to patients can range from 24 hours to several 
weeks, and there is no accepted standard of care probably 
because of the lack of data and controversial issues associ- 
ated with extended periods of silence. 


+ Although some data suggest that 2 weeks of silence has 
therapeutic value, the advice is commonly ignored 
because of its impracticality. Moreover, there are conflict- 
ing studies in the literature suggesting vocal rest is of lit- 
tle value. Based on a study of 127 patients who had sur- 
gery for vocal nodules, polypoid degeneration, granuloma, 
cysts, polyps, leukoplakia, and carcinoma, Koufman and 
Blalock (1989) concluded that voice rest provided no 
greater protection against postoperative dysphonia than 
did an alternate program of voice conservation, that is, 
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soft glottal attack, loudness monitoring, and avoidance of 
abusive vocal patterns. 


¢ Some clinicians believe that absolute voice rest can do 
harm. It implants the idea in the patient’s psyche, espe- 
cially in those who have susceptibility to psychogenic 
voice disorders, that the larynx and voice are foci of weak- 
ness and that normally energetic voice use is damaging. 
This idea may persist beyond the duration of the pre- 
scribed voice rest and, unless reversed, can lead to anxiety 
and to a secondary voice disorder or respiratory inhibi- 
tion. To say it another way, when the recommendation of 
voice rest creates, perpetuates, or exacerbates a voice dis- 
order, as it often does, voice rest can be considered an 
iatrogenic! cause of a voice disorder. Overcautious voice 
use is often elected by patients even when not advised to 
rest their voices. In actuality, the opposite kind of voice 
use is desired in some patients with voice disorders, that 
is, rather than voice rest, they need to adopt firm, normally 
loud voices produced with equally firm exhalatory force. 


For some patients, an alternative to voice rest is use of 
confidential voice. Like voice rest, the technique is designed 
to protect the vocal folds by reducing impact stress. 
According to Colton and Casper (1990, 1995, 2005), confi- 
dential voice is using a quiet breathy voice as if you were 
speaking confidentially to a person no more than an arm’s 
length away. Clinician modeling a confidential voice and 
contrasting it with undesirable stage whispers is critical to 
success of this treatment. Clinicians using confidential 
voice therapy should make sure that the patient knows and 
can produce the differences and should limit the time the 
patient is placed on confidential voice to no longer than is 
necessary for the vocal injury to heal. Studies of the 
microstructure of the vocal fold suggest that 1 to 2 weeks 
should be adequate. 


Vocal Hygiene 


One of the most commonly used treatment regimens is 
vocal hygiene. Vocal hygiene focuses on reducing or elimi- 
nating behaviors that contribute to the cause or mainte- 
nance of voice problems and, by so doing, is thought to 
improve tissue characteristics and vocal fold vibration. It 
may include some form of voice rest and antireflux direc- 
tives (discussed elsewhere in this chapter) and specific 
lifestyle changes such as diet changes, not eating immedi- 
ately before retiring, avoiding speaking over noise, alter- 
nating talk time with quiet time, speaking within the 
middle of one’s pitch and loudness range so as to avoid 
phonatory extremes, and increasing hydration. Vocal 
hygiene programs are individually tailored to the habits 
and needs of the patient. Clinicians need to be cautious 
that they are not overly ambitious in program develop- 
ment. Too many demands for lifestyle change can be over- 
whelming and result in none of the directions being 
adhered to. 


Natrogenesis is defined as illness induced by unavoidable or 
improper diagnosis or therapy. 


+ Feedback Devices in Voice Therapy 


It cannot be overstressed how important a role feedback plays 
in voice therapy. Because there is minimal tactile and proprio- 
ceptive sensation arising from the larynx during phonation, 
most patients rely primarily on audition. Audition appears to 
be the primary channel through which voice is monitored, as 
proved by the congenitally deaf who are known for their 
severely aberrant voices, which are typically monopitched, 
poorly modulated in loudness, and abnormal in nasal reso- 
nance. When voice becomes defective, intensification of its 
parameters during therapy is needed. Audio recordings are 
probably the most common feedback device used by clini- 
cians to enable the patient to examine voice after it has been 
produced. However, delayed presentation of the voice may 
have limited benefit as it is often difficult for the patient to 
relate what he or she produced to what he of she hears 
moments later, and instantaneous feedback is also necessary. 
Amplification of the voice, feedback through earphones, is one 
means. Even simpler is cupping the hand around the ear or 
phonating while facing the corner of a room, where the voice 
rebounds off the walls. Feedback devices should be consid- 
ered as adjuncts to therapy but should not be substituted for 
the interpersonal and environmental techniques integral with 
the mechanical aspects of therapy. 

Many devices have been developed to enhance sensory 
feedback of voice production. One of the first specially devised 
instruments for vocal abuse disorders is the Voice Intensity 
Controller, which instantly warns the speaker when vocal 
intensity becomes excessive. Worn like a hearing aid, with the 
microphone attached to the base of the neck anteriorly and an 
earphone in the ear, the device can be set so that when the 
speaker’s voice exceeds a certain intensity level, a warning 
tone is triggered. Holbrook, Rolnick, and Bailey (1974) 
reported on the complete resolution of vocal nodules, polyps, 
and contact ulcers in 32 patients who used this device, the 
mean treatment time being 5.3 weeks. Sound spectrography 
provides visual feedback display of voice spectra, intensity, 
and duration, which can be effectively used as feedback in 
therapy and home training. Both clinicians and patients have 
easy access to this type of feedback through either commer- 
cially available units or freeware obtained over the Internet. 
We have already been introduced to these devices as clinical 
evaluation and research instruments, but they clearly serve 
dual purposes when they provide an instantaneous visual dis- 
play on a computer screen. Because of their sensitivity to 
noise in the voice spectrum, these devices are especially use- 
ful for the treatment of voice-quality disorders. 

The ability to visually monitor voice pitch or loudness is 
afforded now by a myriad of instruments. Typically, either a 
head- or neck-mounted microphone transduces the acoustic 
voice produced by the patient into a visual signal enabling 
the speaker to read the frequency or intensity. Often, this is 
in the form of needle deflection, vibration, digitized num- 
bers, or bars reflecting voltage change. Selected instruments 
provide storage of data useful to the clinicians monitoring 
treatment as well as direct feedback to the patient. 

For example, Cheyne, Hanson, Genereux, et al (2003) in 
concert with Kay-Pentax has developed a vocal accumulator 
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that monitors vocal load and provides individual patients 
with feedback about their frequency, intensity, and duration 
of phonation throughout the day. This portable device holds 
considerable promise in heightening patient awareness of 
excess vocal loads and voice-production problems, inform- 
ing patients about harmful vocal behaviors and assisting cli- 
nicians in helping patients get increased practice. 


Vocal Function Exercises 


Vocal function exercises comprise a treatment program 
designed by Stemple to build strength and endurance in the 
laryngeal muscles and in so doing improve range and control 
of voice production (Sabol, Lee, and Stemple, 1995; Stemple, 
Lee, D’Amico, et al, 1994). The exercise program is based on 
the notion that laryngeal muscles are skeletal and thus 
should respond to exercises similar to those used to improve 
limb control. Thus, exercises are organized into warm-up 
and cool-down, stretching and contraction, and power mod- 
ules. The exercises were first described by Briess (1959) and 
were based on his extensive work with laryngeal EMG. The 
program specifies that patients complete a prescribed series 
of nasalized vowel prolongations and pitch glides twice a 
day for several weeks. The patient is given a recording diary 
to keep track of the length of time he or she is able to sustain 
phonations, the number of trials practiced, and the number 
of tones produced. Patients going through this prescribed 
program seem to gain better control over airway valving and 
in so doing reduce hyperfunctional behaviors. 


Lee Silverman Voice Therapy 


The Lee Silverman Voice Therapy (LSVT) was initially devel- 
oped by Ramig and colleagues to improve the voice and speech 
of individuals with Parkinson’s disease (Ramig, Sapir, Fox, and 
Countryman, 2001; Sapir, Pawlas, Ramig, Countryman, O'Brien, 
Hoehn, and Thompson, 1999; Sapir, Pawlas, Ramig, Seele, Fox, 
and Corboy, 1999; Ramig and Dromey, 1996; Ramig, Country- 
man, Thompson, and Horii, 1995; Dromey, Ramig, and John- 
son, 1995; Ramig, Countryman, Thompson, Horii, 1995; Smith, 
Ramig, Dromey, Perez, abd Samandri, 1995; Countryman, 
Ramig, Pawlas, 1994; Mamig, Countryman, Thompson, and 
Horii, 1994; Ramig, 1993). Since the initial development, LSVT 
has also been successfully applied to individuals with other 
neurologic disorders, to geriatrics with presbyphonia and to 
individuals with Down syndrome. LSVT is an intensive treat- 
ment program that focuses on increasing loudness. Therapy 
sessions are held daily, 4 times per week, for 4 weeks, generally 
with certified LSVT trainers. In both the daily treatment and 
home practice sessions, patients attempt to recalibrate them- 
selves to speak louder without feeling they are yelling. Tasks 
focus on loud vowel prolongations at different pitch levels and 
production of increased loudness in short phrases building up 
to conversational speech. Patients are taught to “think loud.” 
Outcome measures of LSVT training have demonstrated that 
not only do patients get louder, but they also have improve- 
ment in pronunciation, inflection, and facial animation. Fol- 
low-up studies have also shown that as long as 12 months after 
completion of treatment, patients continue to speak with 
louder voices. This is impressive as many of the individuals 


have progressive neurologic disease. It has been postulated 
that one of the reasons this technique is effective is that it is 
simple and by focusing on a single parameter, the patient is 
able to improve control and coordination of the respiratory- 
phonatory apparatus. The developers of LSVT underscore the 
importance of clinician training, reinforcing target behaviors, 
and recalibrating patients as to what constitutes “normal” pro- 
duction levels. 


+ Reactive Anxiety 


In planning treatment, clinicians need to recognize the poten- 
tial for a serious dilemma in the diagnosis and treatment of 
voice disorders that can get in the way of helping patients and 
their ability to help themselves: the strong tendency for 
patients to become anxious and overprotective of their laryn- 
ges as a reaction to the diagnosis of organic or psychogenic 
voice disorder. The larynx is particularly vulnerable to emo- 
tional disequilibrium, mild or severe, transient or chronic. One 
reason for the oversensitivity of the larynx is its vital impor- 
tance in everyday life for intellectual and emotional expres- 
sion. We cannot appreciate the importance of a normal voice 
until the larynx undergoes structural or physiologic changes 
producing dysphonia or aphonia. The most stalwart person 
sooner or later will become frustrated, embarrassed, and 
hypersensitive to the obstacle that abnormal voice presents. 

A second reason for the patient’s vulnerability to overcon- 
cern and emotional reactivity to dysphonia is that the larynx 
is well known in the public mind as a site of malignancy, and 
the most common concern that patients express when they 
develop abnormal voice is that they may have cancer of the 
larynx. 

Thus, our problem, diagnostically and therapeutically, is 
that, despite our best efforts to prevent it, patients will gener- 
ate overanxiety about their larynges and voices. Any advice 
given to them about the need to avoid certain kinds of abusive 
vocal activities, or even simply the administration of voice 
therapy, reinforces the patient’s belief that the larynx has to 
be protected at all costs. This attitude easily translates into 
anxiety and its associated muscle tensions. The patient elects 
to phonate with less than normal expiratory effort, breathing 
shallowly and phonating weakly. That is why the laryngologist 
and the speech-language pathologist, who on the one hand 
need to give advice and therapy for voice improvement, on the 
other hand must neutralize the tendency toward the patient’s 
overconcern by not advising it too strenuously and by reassur- 
ing the patient that full voice use is desirable as soon as possi- 
ble. Particularly, we need to emphasize that the patient need 
not protect the larynx by using reduced loudness and respira- 
tory volumes. In some cases, the clinician does more for the 
patient by giving simple advice on proper voice instead of a 
long series of voice therapy sessions, which risk reinforce- 
ment of the patient’s apprehensions. 


Telemedicine 


A question raised by students and insurance companies con- 
cerns whether the voice-treatment objectives can be 
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obtained by anything other than individual treatment ses- 
sions, which are expensive and an inefficient use of clinician 
time. Efficacy studies comparing group versus individual 
voice therapy outcomes have not been completed. Mashima, 
Birkmire-Peters, Holtel, et al (1999) initiated a novel project 
to investigate the potential of remediating voice disorders 
remotely using telemedicine. With computer technology 
and Web cams, this type of treatment becomes an increasing 
reality and may be particularly beneficial for patients living 
in rural areas or patients who have difficulty taking time off 
work to come to a voice center for treatment. Telemedicine 
makes it possible to receive voice treatment in the privacy of 
one’s computer wherever it might be located. 


¢ Phonosurgery 


Phonosurgery is laryngeal surgery done solely, or primarily, 
to improve phonation. This requires some glottic adjust- 
ment, and when closure is improved, often gains are 
achieved not only in voice but also in breathing, vocal effort 
level, and swallowing. One way of understanding the myriad 
of phonosurgery techniques is to classify them in one of five 
categories: removal of pathologic tissue; correction of the 
position, shape, or tension of the vocal fold; restoration of 
neuromuscular function; reconstruction for partial loss of 
the larynx; and reconstruction for entire loss of the larynx. 
Reconstruction for partial or total loss of the larynx is outside 
the scope of this text. A brief overview of the other three cat- 
egories follows. 

Various procedures designed to remove pathologic tissue 
are commonly used in cases of polyps, cysts, nodules, and 
papillomas. This extirpation surgery can either be done with 
a laser or a cold knife. Special microsurgical instruments 
have been developed for this delicate surgery. Laser surgery 
is controlled destruction by thermal evaporation. The advan- 
tage of this technique is the precision and healing. The dis- 
advantage is possibility of error and unwanted destruction of 
tissue. 

As with all surgery, there is a possibility of encountering 
problems. Although poor outcomes are rare when surgery is 
done by a “phonosurgeon,” voice clinicians should have 
knowledge of the potential adverse affects. The patient may 
develop excessive edema and have airway problems that in 
the worst case cause death or require a tracheotomy. It is also 
possible that the patient will have an allergic reaction to the 
anesthesia or to a material being used for an implant. Any 
time there is tissue damage, there is also possibility of laryn- 
geal scarring. There is no consistently successful treatment 
for scarring. Thus, this iatrogenic problem is considered to be 
the most difficult of laryngeal problems to treat and one of 
the worst adverse consequences of laryngeal surgery. When 
scarring develops, the voice is often worse than before treat- 
ment. On occasion, the scarring may be so extensive as to 
render the patient aphonic. 

The literature is replete with phonosurgical procedures 
that come under the category of correction of position, 
shape, or tension. Most of the procedures are either consid- 


ered to be vocal fold augmentation, or cordal injection, or 
cartilaginous framework surgery. 

Laryngeal framework surgery was first described by 
Isshiki (1989a,b). Isshiki proposed remodeling the laryngeal 
framework to achieve better glottal closure and/or to 
improve pitch. He divided these “thyroplasty” procedures 
into groups based on whether the removal resulted in a 
change in position, shape, or tension: type I, lateral compres- 
sion to achieve median shift of the vocal folds; type II, lateral 
shift to expand distance; type III, shortening or relocation; 
and type IV, lengthening to increase tension and elevate 
pitch by cricothyroid cartilage approximation. 

The most common thyroplasty procedure is type I. This 
procedure is used to correct glottic incompetence when the 
vocal folds fail to come together. This procedure has been 
successfully used in patients with vocal fold paralysis, vocal 
fold bowing, sulcus vocalis, Parkinson’s disease, and in 
abductor spastic dysphonia. A small window is cut in the 
thyroid cartilage. A muscle, Gore-Tex (PTFE, poly-tetraj-fluoro 
ethylene; Gore Medical Products Division), fat, or some other 
alloplastic or biological material is placed in the window to 
push the vocal fold medially. The procedure is done under 
local anesthesia so that the surgeon can not only see how 
well the vocal folds are approximating but also can elicit 
vocalizations from the patient to see how well he or she is 
functioning. Windows are drilled in only one side for a uni- 
lateral procedure or both sides for a bilateral procedure. One 
of the beauties of this procedure is that the delicate layer 
structure of the vocal fold lamina propria does not get dis- 
rupted. Moreover, if in follow-up evaluation the procedure is 
deemed less than ideal, the thyroplasty can be revised. 

Cordal injections have a long history (Ford, 2004). Brun- 
ning (1911) described correcting vocal fold paralysis by 
injecting paraffin, bone paste, and a variety of other materi- 
als. Arnold (1963) proposed Teflon injection. Initially, Teflon 
appeared to be an ideal substance. Results were immediate 
and appeared to be effective. However, as with many such 
procedures, the immediate glow of success was overshad- 
owed by subsequent reports of long-term consequences of 
migrating Teflon particles, lack of control as to placement of 
bolus of Teflon, foreign-body reactions, and Teflon granulo- 
mas. Today, most clinics use Teflon only with terminally ill 
patients or patients in their eighth decade of life. 

The last category of phonosurgery to be discussed is laryn- 
geal reinnervation. The concept of replacing a paralyzed or 
injured nerve is not new. Nearly 100 years ago, Horsley 
reported repair of an injured recurrent laryngeal nerve (RLN). 
Subsequently, numerous surgeons have reported success 
with reinnervation of the RLN with a branch of the ansa cer- 
vicalis (Crumley, 1982, 1991; Crumley, Izdebski, and 
McMicken, 1988; Frazier, 1924; Goding, 2005; Paniello, 2001, 
2001; Smith, 2006). The objective of laryngeal reinnervation 
is to restore physiologic movement and sensation to a para- 
lyzed larynx. The most useful donor nerve in laryngeal paral- 
ysis treatment is the ansa cervicalis nerve (Smith, 2006). 
According to Smith, it matches favorably with RLN, has little 
to no dysfunction from transection of the branch to the ster- 
nohyoid or omohyoid muscles, can be obtained from either 
ipsilateral or contralateral sides, and can be connected to the 
larynx by direct neurorrhaphy, nerve-muscle-pedicle, or 
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direct muscle implant. Of these three techniques, the direct 
neurorrhaphy appears to be most effective and is thought to 
have “theoretical advantages with restoring viscoelastic sym- 
metry and not just geometric symmetry” (Smith, 2006), as in 
the case of medialization procedures using implants or 
injectables. However, clinicians need to recognize that the 
voice after the neurorrhaphy may be unchanged or worse for 
the first 3 or 4 months until reinnervation begins. The voice 
can then be expected to continue improving over a 12-month 
period. In instances when the patient requires immediate 
resolution of the voice disturbance, the procedure may be 
combined with a temporary injectable such as Gelfoam (Sur- 
gicel; Pfizer, Inc., New York, NY) or collagen. The neurorrha- 
phy generally improves adduction but does little for abduc- 
tion. Zealear and associates (2002) have suggested using 
electrical stimulation of the denervated muscle to promote 
selective reinnervation by native over foreign motoneurons. 
In repeated studies in canines and preliminary human clini- 
cal trials, they have demonstrated that paced muscle stim- 
ulation using an implantable commercial stimulator 
results in preferential repressed reconnection by foreign 
motoneurons, thereby promoting selective reinnervation. 


+ Management of Specific Voice Disorders 


The remainder of this chapter is devoted to therapy that has 
evolved for specific voice disorders based on the special 
requirements of each disorder. 


Musculoskeletal Tension (Vocal Hyperfunction) 


The majority of cases we shall find suffer from some 
excess of muscular function (hyperfunction) in various 
degrees. However, this refers not to the vocal folds them- 
selves but rather to the whole tensor mechanism of the vocal 
organ, which comprises all the muscles surrounding the 
cartilages of the larynx. We have generally not found it 
necessary—and often not possible—to decide which spe- 
cific muscle or muscles were at fault (Weiss, 1971). 

Life stress elevates general muscle tension, including that 
of the laryngeal muscles, and can produce an array of voice 
defects: aphonia, breathiness, hoarseness, or excessively 
high pitch. No single voice type is associated with muscu- 
loskeletal tension. Vocal hyperfunction is the most impor- 
tant concept in the etiology and treatment of nonorganic, 
and many organic, voice disorders. Intrinsic and extrinsic 
laryngeal muscle tension expresses anxiety, depression, or 
anger. In organic voice disorders, it may occur as a result of 
efforts to compensate for the organic deficit. All patients 
with voice disorders, regardless of etiology, should be tested 
for excess musculoskeletal tension, either as a primary or as 
a secondary cause of dysphonia. The degree of voice 
improvement after therapy for musculoskeletal tension is 
proportional to the reduction of musculoskeletal tension. 

Following are the principles basic to therapy for muscu- 
loskeletal tension associated with vocal hyperfunction. 


+ Extrinsic and intrinsic laryngeal muscle cramping is 
responsible for the abnormal voice. Reducing muscu- 
loskeletal tension releases the inherent capability of the 
larynx to produce normal voice. 


e When gently rubbed or kneaded, muscles relax and 
become less painful. 


+ Lowering laryngeal position in the neck permits more 
normal phonation. 


Testing for Muscular Tension 


The first procedure for implementation of these principles is to 
assay signs of musculoskeletal tension. Its presence is sus- 
pected when patients complain of spontaneous pain in the 
region of the larynx shooting up to the ears and down to the 
chest and a feeling of a lump, a ball, or tension in the larynx or 
pharynx. Testing for muscle tension is subdivided into deter- 
mining three things: (1) extent of laryngeal elevation, (2) pain 
in response to pressure in the region of the larynx, and (3) 
extent of voice improvement after tension reduction (Fig. 11.1): 


1. Encircle the hyoid bone with the thumb and middle fin- 
ger, working them posteriorly until the tips of the major 
horns are felt. 


2. Exert light pressure with the fingers in a circular motion 
over the tips of the hyoid bone and ask if the patient feels 
pain, not just pressure. It is important to watch facial 
expression for signs of discomfort or pain. 


3. Repeat this procedure with the fingers in the thyrohyoid 
space, beginning from the thyroid notch and working pos- 
teriorly. 


4. Find the posterior borders of the thyroid cartilage just 
medial to the sternocleidomastoid muscles and repeat 
the procedure. 


5. With the fingers over the superior borders of the thyroid 
cartilage, begin to work the larynx gently downward, also 
moving it laterally at times. One should check for a lower 
laryngeal position by estimating the increased size of the 
thyrohyoid space. 


6. Ask the patient to prolong vowels during these proce- 
dures, noting changes in quality or pitch. Clearer voice 
quality and lower pitch indicate relief of tension. Because 
these procedures are fatiguing, rest periods should be 
provided. 


7. Once a voice change has taken place, the patient should 
be allowed to experiment with the voice, repeating vow- 
els, words, and sentences. 


Rate of improvement varies. If the dysphonia has been due 
to musculoskeletal tension alone and has been present only 
a short time, the voice may become normal within minutes. 
Those who have been dysphonic longer may require several 
hours of therapy administered in separate sessions. The 
voice rarely changes from aphonia or dysphonia to normal 
without first passing through several dysphonic stages. The 
clinician should not hesitate to attempt normal voice within 
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Figure 11.1 (A) Loci of laryngeal and hyoid bone pain from digital pressure (shaded areas). (B) Maneuvering the larynx to a lower position in the neck. 


one session, for the majority of patients can be helped con- 
siderably, if not completely, within that time. Patients whose 
voices fail to improve despite the above ministrations may 
not be ready to relinquish the abnormal voice because of 
musculoskeletal tension secondary to conversion reaction. 
One should watch for the first signs of improvement. Recov- 
ery takes on its own momentum. A switch then should be 
made from repetitions of speech sounds and simple nasal 
sentences to spontaneous speech beginning with more auto- 
matic responses such as the patient’s name, address, and 
occupation. 

Once the normal voice has returned, most patients ask why 
it did and why they were not able to accomplish this by them- 
selves. The clinician explains the effects of muscle tension on 
voice in terms the patient can understand. Often, after voice 
improvement, the patient recalls unpleasant life situations 
giving rise to the voice disorder. A frank and open discussion 
of such situations and the extent of their seriousness is neces- 
sary to determine the need for more intensive psychologic 
investigation and therapy. Commonly, musculoskeletal ten- 
sion is not due to a specific problem but is the patient’s life- 
long way of reacting to stress. Such patients overdrive them- 
selves, and changing daily habits may serve to preclude future 
abnormal voice episodes. Realistically, the normal voice may 
be short-lived, and the dysphonia may return when the ten- 
sion returns. Not much is known about the long-term fate of 
such therapy or ways to prevent further occurrences. 

The literature on voice therapy is replete with advocacy of 
chewing therapy, progressive relaxation, and biofeedback for 


vocal hyperfunction as described earlier in this chapter. 
These therapies have the same objective as the method just 
described. However, musculoskeletal tension is a powerfully 
resistive force, and less aggressive methods often fail. Here, 
as elsewhere, the best approach is trial and error. 

Maneuvering the patient’s laryngeal and hyoid anatomy as 
just described is safe as long as it is performed with good 
judgment. This technique is not offered as an alternate ther- 
apy but as a primary one. It is hard work for both clinician 
and patient and would not be advocated were it not useful in 
achieving its aims. Patients will complain of soreness in the 
region of the larynx after such therapy, but they can be 
assured that it will disappear within a day or two. 

Finally, in addition to elevation of the larynx and hyoid 
bone, the tongue may be elevated, integral with the total 
pattern of extrinsic laryngeal muscle contraction. The 
tongue is carried high in the mouth, and the submental mus- 
culature is tense and painful. The tongue should be pressed 
down along with the hyoid bone and larynx, either with the 
finger or with a tongue depressor, while simultaneously lis- 
tening for voice improvement. 


Voice Abuse 
Vocal Fatigue 


Fatigue is defined as an inability to maintain a power output 
or force during repeated muscle contractions (Powers and 
Howley, 1990). In limbs, fatigue is often remedied by rest and 
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by exercise building up strength, endurance, and flexibility. A 
one-to-one relationship cannot be made between limb and 
laryngeal fatigue because laryngeal fatigue involves both 
muscle (body) and tissue (cover including epithelium and 
lamina propria). Consequently, to understand its treatment, 
it is necessary to look at the assumptions underlying vocal 
fatigue, and the physiologic limitations of the vocal appara- 
tus. According to contemporary researchers, it is assumed 
that vocal fatigue relates to the ability of the laryngeal appa- 
ratus to withstand various vocal loads. In other words, there 
is a limit to the mechanical stress that can be placed on the 
vocal system. But like all systems, there is a range of 
strength, and some instruments are inherently stronger than 
others. The ability to sustain a vocal load is, in part, related to 
the strength of the apparatus but also to the age, the use, and 
the conditions under which it is used and the material it is 
made from. Apparatus strength or lack thereof may be off- 
set by training that improves duration of withstanding a 
given load. For any given individual, the strength of the 
apparatus may vary and be more vulnerable, or at risk, dur- 
ing times of stress, specific medical conditions, after inges- 
tion of various pharmacological agents and recreational 
drugs that either dilate blood vessels (vasodilators) or dry 
out tissue (diuretics), or after periods of excessive use of the 
vocal instrument particularly when the external environ- 
ment is poor (dusty, noisy, or dry). This being the case, 
there are several occupational hazards because of vocal 
load requirements. Foremost among these hazards is teach- 
ing where teachers are required to talk with a loud voice for 
long periods of time often in less than ideal speaking envi- 
ronments: the rooms often have poor acoustics, are dusty, 
and contain noisy children. Thus, much of the recent 
research on workload and vocal fatigue has been accom- 
plished on teachers. 

In voice, prolonged phonation is one way to view the 
laryngeal apparatus’s ability to maintain a power output or 
force. Titze (1999) suggests there are five possible physio- 
logic limitations to prolonged phonation resulting in vocal 
fatigue: (1) fatigue of laryngeal muscles that provide tension 
and stability; (2) fatigue of respiratory muscles that provide 
subglottic pressure; (3) relaxation of passive tension (liga- 
ments, joints, and membranes); (4) increase in tissue viscos- 
ity resulting from increased friction from dehydration or 
chemical changes in fluid compensation; and (5) change in 
blood circulation where there may be either constricted 
blood vessels impeding regenerative processes and reduced 
capacity to transfer heat away from the vocal folds or vasodi- 
lation resulting in increased mass. 

The symptoms of vocal fatigue are innumerable. Among 
the most common symptoms reported by patients are voice 
gives out at the end of the day; increased vocal effort; change 
in vocal quality (increase in breathiness or glottal fry); 
decrease in vocal range, flexibility, or endurance; abnormal 
throat sensations such as hurting, burning, drying, or 
scratchiness; a tight chest or increased difficulty breathing; 
or increased throat clearing or coughing. Generally, these 
symptoms are reported as being worse at the end of the day 
or after vocalizing (singing, teaching, talking) for some 
period of time. Most notable is that the patient notices a 
change in vocal ability and effort. 


How then does one treat vocal fatigue, a condition that 
often changes during the course of the day and does not 
necessarily result from the presence of pathology but rather 
from too heavy a vocal load for the apparatus? Research 
completed in the past 5 years suggests that any treatment 
that reduces the load will in turn reduce vocal fatigue. This 
may include increased hydration, periods of vocal rest, exer- 
cises building endurance, and use of amplification. The 
effectiveness of these treatments has been documented in a 
series of studies by Roy and colleagues (2000, 2001a,b, 
2002, 2004), Smith and colleagues (1996, 1998a,b), Jonsdot- 
tir and colleagues (2001, 2002, 2003), Sapienza, Crandell, 
and Curtis (1999), Yiu and Chan (2003), and McCabe and 
Titze (2002). 

Yiu and Chan (2003) investigated the effect of hydration 
and vocal rest on vocal fatigue in 20 young amateur karaoke 
singers. Half of the singers were given water to drink and 
short duration of vocal rests at regular intervals during 
singing, and the other half sang continuously without taking 
any water or rest. The subjects who were given hydration 
and vocal rests sang significantly longer, and the subjects 
who sang continuously without drinking water and taking 
rests showed significant changes in jitter measure and the 
highest pitch they could produce during singing. The authors 
interpreted these results to suggest that hydration and vocal 
rests are useful strategies to preserve voice function and 
quality during karaoke singing. Whether similar results 
would be found in other singers suffering from vocal fatigue 
or found with vocal rest without hydration could not be 
determined from the study. Like amateur karaoke singers, 
preschool teachers have vocal heavy loading factors and lit- 
tle voice training. Sodersten, Granqvist, Hammarberg, et al 
(2002) suggest that one way to reduce the vocal loading for 
this occupation is to “include pauses so that preschool teach- 
ers can rest their voices.” Chan (1994) took a different 
approach with kindergarten teachers. He found that teachers 
who followed a vocal hygiene program for 2 months 
improved their vocal health by reducing vocal abuses and 
practicing healthy voice strategies. However, conflicting 
results were reported by Roy (2002). Roy and colleagues 
studied the effects of using vocal hygiene versus vocal exer- 
cises in 58 teachers with voice problems who were randomly 
placed in three groups (no treatment controls, vocal hygiene, 
and vocal function exercises). Only the teachers who were 
persistent with vocal function exercises showed improve- 
ment. However, although teachers in the vocal hygiene 
group did not improve, they did not get worse as did the no- 
treatment group. In another study conducted by Roy and col- 
leagues (2005), 64 teachers were randomly assigned to one 
of three treatment groups: resonant voice therapy, amplifi- 
cation use, or respiratory muscle training. Teachers in the 
amplification and resonant voice therapy showed improve- 
ment with those in the amplification group exhibiting the 
greatest change. Similar results about the benefit of amplifi- 
cation in reducing the vocal fold and reducing vocal fatigue 
have also been reported by Sapienza, Crandell, and Curtis 
(1999) and Jonsdottir (2001, 2002, 2003). Chant therapy also 
appears to be effective in remedying vocal fatigue (McCabe 
and Titze, 2002) by decreasing the vocal load through 
improving resonance balance and reducing adductory forces 
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Table 11.1 Clinical Clues to Distinguish LPR from Other Causes of Hoarseness 


Rhinosinusitis Benign Vocal Malignant Vocal 
LPR Infection (Postnasal Drip) Allergy Fold Lesion Fold Lesion 
Hoarseness Fluctuates Acute, resolves Acute/chronic Fluctuates Constant Progressive 
characteristic or recurrent 
Throat pain Common Yes Uncommon No From Late (local 
(with cough, secondary and referred) 
throat muscle 
clearing) tension 
Laryngeal Edema, Erythema, Secretions Edema, Nodules, Ulcerative 
findings granuloma, edema (thick, clear polyps, or exophytic 
erythema, discolored), secretions, cysts, (red-white 
pseudosulcus edema bluish mucosa scars mass), stiff 
Aggravating Smoking, Systemic LPR, allergy, Environment, Smoking, Smoking 
factors obesity, infection, smoking seasonal vocal (common), 
diet/lifestyle immunosuppression trauma, LPR LPR, ethanolism 


LPR, Laryngopharyngeal reflux. 


of the vocal fold. It is not surprising to learn that reportedly 
chant therapy, Accent technique, and resonant therapy all 
have positive results in reducing vocal fatigue. These treat- 
ment techniques promote relaxation, easy phonation, and 
resonant balance and in so doing reduce the vocal load. 
Amplification reduces the load by reducing the vocal inten- 
sity of the speaker. 

Based on existing assumptions and research completed 
primarily on teacher populations, it appears that a treatment 
program successful in reducing vocal fatigue and improving 
vocal quality focuses on reducing vocal load by providing 
the patient with (1) education as to good vocal hygiene, 
(2) amplification for speaking in noisy environments, (3) 
interspersed periods of vocal rest, and (4) treatment tech- 
niques that improve vocal resonance, relaxation, vocal flexi- 
bility, and endurance (vocal function exercises). 


Reflux 


Esophageal reflux is included under vocal abuse because it 
irritates, or abuses, the laryngeal tissues and creates a condi- 
tion ripe for development of laryngeal pathology. The Com- 
mittee on Speech, Voice and Swallowing Disorders of the 
American Academy of Otolaryngology-Head and Neck 
Surgery’s position statement of laryngopharyngeal reflux 
(LPR) states that LPR refers to the backflow of stomach con- 
tents into the throat and differs in mechanism and manifesta- 
tions from classic gastroesophageal reflux disease (GERD) 
(Koufman, Aviv, Casiano, et al, 2002). The primary problem in 
LPR is upper-esophageal sphincter dysfunction, which can 
occur at any age. Koufman and her colleagues (1985, 1991) 
have demonstrated the detrimental effects of reflux and they 
have also demonstrated experimentally that as little as three 
LPR episodes a week can result in severe laryngeal damage 
because the larynx lacks the epithelial defenses present in 
the esophagus. The most common symptoms of LPR are 
hoarseness, globus pharyngeus, dysphagia, cough, chronic 
throat clearing, vocal fatigue, and sore throat (Ford, 2006; 
Koufman, Amin, and Panetti, 2002). It has been estimated by 
Koufman, Amin, and Panetti (2002) that up to half of patients 


with laryngeal and voice disorders have reflux of varying 
degrees. The severity of the disease determines the treat- 
ment. Treatment often includes recommendations for dietary 
and lifestyle changes to reduce the factors thought to precip- 
itate reflux (e.g., smoking, drinking alcohol, ingesting coffee, 
eating spicy foods, eating just before bed, sucking on mints, 
wearing tight belts, bending at waist level, and sleeping with- 
out elevation of the head of the bed). In treating LPR, behav- 
ioral modification alone often is insufficient, and medical 
management of acid suppression and esophageal motility is 
also indicated. Ford (2005) developed a schema to provide 
clinicians with the clinical clues necessary to distinguish LPR 
from other causes of hoarseness (Table 11.1). He underscores 
the importance of case history taking and suggests that when 
the history is suggestive of diagnosis, patients be put on a 3- 
month trial of behavioral change and gastric acid suppression 
by adequate doses of proton pump inhibitor (PPI) medication. 
Patients failing this trial should be referred for multichannel 
intraluminal impedance and pH-monitoring studies to con- 
firm LPR and assess the magnitude of the problem. Koufman, 
Amin, and Panetti (2002) state, “For most patients with LPR, 
twice-daily dosing with a PPI is recommended for an initial 
treatment for a period of no less than 6 months, and lifetime 
treatment may be required” (Fig. 11.2). 


Vocal Nodules in Children 


In children and adults, vocal abuse produces general inflam- 
mation or focal lesions that result in breathiness, huskiness, 
or hoarseness. It may occur secondary to LPR, coincidently 
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Figure 11.2 Algorithm for assessment and management of LPR from 
Ford. 
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with LPR, or alone. Musculoskeletal tension is often the 
physiologic basis of the abuse, and emotional and personal- 
ity factors are its driving force. Vocal abuse in children takes 
the forms of loud talking, shouting, screaming, coughing, or 
throat clearing. The purposes of therapy are to reduce vocal 
fold trauma through reeducation and to reduce or eliminate 
emotional and environmental stress. 

The therapy of choice in children and adults who have 
vocal nodules is vocal reeducation rather than surgical 
removal of the nodules, which may only return if vocal abuse 
is not reduced. Surgery without treating vocal abuse is now 
recognized as ineffective, or worse, in the treatment of vocal 
nodules except in cases of chronic, advanced, mature, fibrous 
nodules or when a microweb is present. Patients who have 
not had speech therapy show a higher recurrence rate 
(Barnes, 1981). 

Von Leden (1988) has written that the presence of vocal 
nodules is not an automatic indication for surgery, particu- 
larly in patients who use their voices professionally, and if 
nodules do not respond to conservative treatment and if sur- 
gery is necessary, all efforts must be made to preserve 
healthy mucous membrane covering the vocal folds. “Strip- 
ping” has no place in the removal of vocal nodules because 
this procedure includes removal of healthy mucous mem- 
brane, denuding the vocal folds likely causing scarring, fibro- 
sis, restricted vibratory excursions, and permanent dysphonia. 
The vocal folds may appear normal on laryngoscopic exami- 
nation even with magnification; however, stroboscopic 
examination has shown limitations in the vibratory motions 
of the vocal folds that are typical in patients who have fibrotic 
changes. This same precaution to preserve the mucous cover- 
ing of the vocal folds applies to other benign lesions. 

In a survey of 535 otolaryngologists, Moran and Pentz 
(1987) found that 59% expressed preference for voice ther- 
apy as the sole mode of treatment for vocal nodules in chil- 
dren; less than 1% preferred surgery as the sole treatment; 
9.4% recommended surgery to be followed by voice therapy; 
50% indicated that voice therapy is frequently or always 
effective in the treatment of vocal nodules in children. 

Voice therapy for children who have vocal nodules can be 
conducted either individually or in small groups. Group ther- 
apy has the advantage of heightening competitiveness and, 
consequently, motivation. 


Informational Phase. The child should be given simplified 
information on the structure and function of the larynx. 
Drawings, color photographs, slides, models, and motion 
pictures of the larynx can be used for this purpose. One 
should explain why the voice is defective by comparing 
photographs or drawings of the normal vocal folds with 
those of vocal nodules or general inflammation. 


Auditory Discrimination Phase. Using audio or video record- 
ings, one can compare children’s normal voices with 
those who have dysphonia as a result of vocal abuse and 
can discuss in detail how the voices differ. In group ther- 
apy, the children can evaluate one another’s voices com- 
pared with the normal. Children can use severity rating 
scales or other means of documenting their reactions. 


Voice-Production Phase. By now, the child should know the 
reason for the voice disorder and its characteristics. Two 


considerations in the voice-production phase are the 
following: 


1. Situations in which vocal abuse is most likely to 
occur. The child needs to be made aware of situations 
in which vocal abuse is likely to occur, for example, 
yelling during sports activities and other forms of 
play, airing of conflicts among friends and family, and 
uncontrolled coughing and throat clearing. 


2. Training the child to reduce vocal loudness and glot- 
tal attacks is central to direct voice therapy. 


+ Reduce vocal loudness The child’s loudness dis- 
crimination abilities can be increased by recording 
voice at various loudness levels or by using various 
feedback devices. One should identify a comfort- 
able loudness level and get the child to agree not to 
exceed this level in outside situations. The child 
should be told to use alternate signaling devices 
instead of yelling or be taught to yell by using more 
abdominal support and less laryngeal constriction. 


¢ Reduce sharpness of glottal attack The child 
should be taught to produce words beginning with 
vowels by using a gentle vocal attack. The differ- 
ence between the sharp or abrupt coup de glotte 
and the easy onset of voicing should be demon- 
strated. This technique can be illustrated by asking 
for phonation during sighing, leading into vowels 
produced with a breathy attack (i.e., beginning 
with the /h/ phoneme). The child should be made 
kinesthetically aware of relaxed, easy exhalation 
along the entire respiratory-laryngeal tract. One 
should teach the child to be aware of muscle sen- 
sation as well as the sound of the voice while 
developing new habits of phonation. 


¢ Discuss personal and family problems As in all voice 
disorders, one should try to eliminate underlying 
causes or aggravating factors. Clinical research has 
shown that children abuse their voices in the 
process of acting out their aggressions, thereby dis- 
charging emotional tensions. Although many chil- 
dren are insufficiently aware of the emotional 
aspects of their lives, others have considerable 
insight and can talk about unpleasant situations, 
feelings, and attitudes toward friends and family. 
Time should be given for discussion of anxiety, frus- 
tration, anger, and unpleasant experiences. Group 
therapy is an excellent vehicle for generating discus- 
sion among children about their feelings and atti- 
tudes. Such discussions can be conducted maturely 
and without guilt about disclosing personal and 
family problems. To do so is valuable for the clinician 
to judge the extent to which the vocal abuse is a 
component of a larger emotional problem. Discus- 
sion with parents, school psychologists, and other 
professionals in a position to help may be necessary. 


Parent Counseling Phase. Parents are an important part of 
the therapy for vocal abuse. They should be informed as 
to why voice therapy is being given their child and the 
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alternatives to behavioral management. They should be 
aware of the potential role of reflux as a causative factor 
and that it is not uncommon in children. They should know 
the rationale and the methods being used and, especially, 
what is expected from the child in the way of self-control. 
They also should be aware that therapy is generally short- 
term and if it fails to yield significant changes that surgery 
may be indicated because of cysts or microwebs. Parental 
interviews are superior to communicating by letter or 
telephone. Parents differ in their attitudes toward voice 
therapy for their children. Many are not aware of the voice 
disorder—to them the child always sounded that way; 
that’s just “his” voice. Others may not be convinced of the 
need for a change and resent that their child has been sin- 
gled out for voice therapy. It is useful, therefore, to clarify 
objectives and to allow parents to express their attitudes. 


Notwithstanding the time-honored concept of vocal nod- 
ule as a simple vocal abuse disorder, research studies have 
now established unequivocally that children who develop 
vocal nodules often have personality traits that make inhibi- 
tion difficult and are often in conflict with family members 
and peers at school. The speech-language pathologist must 
weigh the factor of vocal acting out because of personality 
and family conflicts as a cause of the vocal abuse, investigate 
psychosocial forces at home and at school, and consider 
appropriate psychologic referral if indicated. One should dis- 
cuss with parents, as with the child, sources of personal or 
family conflict that might be producing anger and emotional 
distress leading to vocal abuse. 

Parents want to know whether or not they should remind 
the child at home about not abusing the voice. There is no 
firm rule except that the child is responsible for the voice, 
and the burden of that responsibility needs to be placed pri- 
marily on the child. If reminders are necessary, they should 
be infrequent and should never take the form of nagging. As 
reminder devices, the child can be given a calendar or other 
charting device on which to record gradations of voice use 
and abuse during each day. The child rates the voice accord- 
ing to how much and how loudly it was used. Although such 
devices are useful as reminders, they should be kept simple. 
Rewards for progress should be given. 


Vocal Nodules in Adults 


Vocal nodules in adults produce dysphonia similar to that 
found in children. It is husky, breathy, hoarse, and of lower 
than average pitch. Because the cause of vocal nodules in 
adults is basically the same as in children, so are the princi- 
ples of therapy. Adults need to be informed of the reasons for 
the nodules. Graphic illustrations from strobe or other laryn- 
geal exams and technical explanations should be tailored to 
the patient’s background. Advice to reduce abuse of the voice 
during speaking and singing is usually accepted and imple- 
mented by most adults. Singers and professional voice users, 
as described in a separate chapter, are special cases particu- 
larly when the presence of vocal nodules gives the singer the 
vocal quality with which he or she is identified. Reduction of 
the nodule changes the vocal characteristics and could ruin 
his or her career. When singing is suspected as a major or 
contributory cause, a singing teacher or vocal coach should 


be consulted. An assessment needs to be made as to how the 
voice ought to be used during singing; for example, loud- 
ness, background accompaniment (which may be excessive), 
and appropriate pitch range. The speech-language patholo- 
gist, patient, and voice coach all need to work together to 
determine the best singing practices to avoid vocal problems 
and aggravation of the nodules. 

In all patients, health problems such as LPR and emotional 
problems paralleling the development of the nodules should 
be investigated. Many patients develop nodules even though 
they have not changed their singing or speaking habits. 
What often emerges during a thorough voice evaluation is 
that they have experienced an upper respiratory infection or 
LPR that resulted in irritated tissue and difficulty in produc- 
ing voice. In many instances, they experienced an emotional 
upset in their lives that became the tipping point resulting in 
increased musculoskeletal tension to the point of adding 
enough frictional trauma to the vocal folds to produce nod- 
ules or they experienced the two in concert. The emotional 
problems may be minor, transient stresses from situational 
problems at work or at home—or more severe chronic 
stresses that seriously interfere with the person’s sense of 
well-being. The speech-language pathologist needs to deter- 
mine how serious these background factors are, and if the 
patient cannot manage them, she might need to be referred 
for psychologic or psychiatric consultation. 

Adult females with vocal nodules are characteristically 
extroverts who are active, talkative people. They openly, 
smilingly, and with a tincture of pride admit they have always 
liked to talk. Within their ability to do so, they do readily 
accept advice to limit the amount of their speaking. However, 
they often have occupations that are not conducive to reduc- 
tion in vocal output. As in children, adults should be taught to 
produce voice with less forceful stress or emphasis patterns, 
which often means lowering vocal intensity, and with easy 
onset of phonation rather than with a hard glottal attack. 
Some speech clinicians advocate reducing the pitch level of 
the voice to an “optimum pitch”; however, the necessity for 
finding an optimum pitch for reduction of nodules is contro- 
versial, as previously discussed in this chapter. Although the 
pitch is perceived as lower, quantitative measurements of fun- 
damental frequency suggest that the fundamental frequency 
(Fo) is generally within normal limits. The added mass and 
abnormal hourglass closure pattern results in a characteristic 
quality that sounds lower and is only changed by reduction in 
the size of the nodules. The clinician should help the patient 
find a comfortable pitch level. The “um-hum” method 
described previously can be used for this purpose. 

If vocal abuse has either caused or maintained the nodule, 
as is often the case, the voice modifications already men- 
tioned should reduce the size of the nodule and even elimi- 
nate it within several weeks, depending upon its size and 
severity, the patient’s ability to comply with therapy, the 
absence of a laryngeal web, and the size of the larynx. Except 
for encapsulated nodules, which many laryngologists believe 
should be removed surgically, voice reeducation is the ther- 
apy of choice and should be attempted for 6 weeks before 
surgery is considered. Patients should be seen periodically on 
return visits for both laryngoscopic and voice reevaluations 
to determine whether the effects of therapy are maintained. 
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One should not hesitate to involve family members in dis- 
cussions with the patient about habits of voice use and 
implementation of therapy. Parents, spouses, siblings, and 
even children can provide objectivity for the adult and can 
help effect the clinician's recommendations by understand- 
ing the objectives of such therapy and feeling like a part of 
the therapy process. 

Electronic devices for measuring vocal load have been 
used successfully in the treatment of vocal abuse disorders 
and should be considered as an important adjunct. 


Contact Ulcer 


Contact ulcers are so called because the granulation tissue 
occurs on the vocal process of the arytenoids where they 
make the greatest contact during loud phonation, coughing, 
and throat clearing. Contact ulcers are thought to result from 
LPR, abuse, and intubation trauma and result in hoarseness 
and excessively low pitch. They are rarely seen in females 
probably because during phonation, females have a poste- 
rior glottic gap and do not totally adduct the arytenoids. 
Therapy aims at reducing LPR and vocal fold trauma through 
vocal reeducation and eliminating emotional and environ- 
mental stress. Prognosis is excellent if the patient is able to 
reduce reflux and produce voice consistently in the manner 
prescribed. In cases of large granuloma formation, surgical 
excision is often necessary. Unless reflux has been managed, 
however, surgical removal of a contact ulcer is often unsuc- 
cessful because the granulation recurs. Clinical evidence 
shows that healing is best achieved with antireflux regimens 
and vocal reeducation. Voice therapy begins with an infor- 
mational phase, during which the patient is given an under- 
standing of the mechanics by which the ulcer has formed, 
using graphic illustrations. During the auditory discrimina- 
tion phase, comparisons between the abnormally low- 
pitched, hoarse dysphonia and normal voice are made. The 
vocal production phase concerns modification of voice use in 
daily life. After a thorough analysis of daily dietary habits 
and voice use and abuse, the patient needs to be informed 
that until the ulcer heals, the quantity and intensity of voice 
use has to be modified. Aronson (1990) suggests voice use be 
reduced ~50% for 1 month, eliminating all but necessary 
speaking. Silence or whispering is contraindicated because 
of patient tendency to use stage whispering. Patients are 
instructed to avoid shouting and communicating over long 
distances or in high ambient noise levels. Patients can return 
to normal voice use afterward provided that they follow the 
additional instructions detailed next: 


Anatomical Predisposition Patients need to recognize 
that they may have an anatomic predisposition to develop 
contact ulcers; tissue covering the arytenoids may be thin, 
tissue may be dry increasing friction during closure, and 
refractory reflux may irritate the tissue covering the vocal 
processes. These individuals must be extra cautious how 
they use the larynx for voice production because without 
careful use, they are likely to have recurrences. 


Produce Words Beginning with Vowels Because words 
beginning with vowels may be produced with an explosive 


glottal attack, which augments the force of vocal fold adduc- 
tion and the frictional trauma to the vocal folds, the patient 
must be trained to produce words beginning with vowels by 
means of an easy or aspirate vocal attack. The patient should 
be taught to recognize and to feel the difference between 
hard and soft glottal attacks. Explosive speech ordinarily 
used for emphasis needs to be modulated. 


Appropriate Pitch Level Because trauma may result from 
excessively low pitch, employed because the patient believes 
it is more authoritative or impressive in the classroom, 
courtroom, or salesroom, a more appropriate pitch level will 
need to be taught. The “um-hum” method is advocated for 
obtaining the most comfortable pitch. 


Reduce Coughing Because frequent coughing or throat 
clearing not only is damaging to vocal folds but also retards 
healing of the contact ulcer, the patient needs to reduce 
these activities. Patients can be taught to swallow hard, sip 
ice water, or massage the neck when they feel the need to 
bring up material, or to clear the throat or cough with an eas- 
ier, More aspirate approach rather than with an explosive 
glottal attack (i.e., the “silent cough”). 


Alcohol and Smoking Because trauma is aggravated by 
alcohol consumption and smoking, the patient needs to be 
informed that both substances irritate the vocal folds. For 
the initial period of 1 month, both drinking and smoking 
need to be eliminated completely. 


Reduce Musculoskeletal Tension Because patients with 
vocal abuse disorders have increased musculoskeletal ten- 
sion, the speech-language pathologist needs to work on 
reduction of those tensions (see the Musculoskeletal Tension 
[Vocal Hyperfunction] section of this chapter). 


Contact Ulcer Patients Contact ulcer patients who are 
hard driving, competitive, angry, and exhibit aggressive 
traits may be modifiable if the patient is concerned enough 
about their effects on the voice. If such tendencies are mild, 
they can be dealt with, but in more severe cases, referral for 
professional psychotherapy needs to be considered. Vocal 
abuse often occurs in fundamentally well-balanced people 
who have allowed themselves to be subjected to tremendous 
environmental stresses, such as an intensive schedule of 
speaking under adverse conditions at work, and burdensome 
extracurricular activities. There needs to be discussion with 
the patient about refusing to participate in such a vocally 
demanding schedule. Heavy work schedules that place the 
patient under extreme pressures resulting in irritability and 
leading to anger are precursors of the vocal abuse that leads 
to contact ulcer. The modification of the psychologic envi- 
ronment of some contact ulcer patients may be as important 
as reflux management and direct voice therapy, and, in fact, 
one without the other may prove ineffectual. 


Conversion Voice Disorders 


Conversion reactions produce diverse voice signs: muteness, 
aphonia, intermittent aphonia, breathiness, hoarseness, 
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excessively high pitch, and falsetto. The abnormal voice, a 
sign of emotional or interpersonal conflict, has served or 
continues to serve the purpose of enabling the patient to 
avoid confrontation with an unpleasant life situation. Also, 
the voice may be serving the purpose of primary or second- 
ary gain. Conversion aphonia or dysphonia may continue 
even after conflict resolution. 

The voice disorder exists on an unconscious level; the 
patient is not willfully or volitionally producing the abnor- 
mal voice and consciously believes that the disorder is 
organic, not psychogenic. Even though the neuromuscular 
system is intact, the patient is unable to phonate voluntarily 
and is convinced of this inability. The interpersonal prob- 
lems causing the voice disorder remain unexpressed to the 
parties concerned, including the clinician from whom the 
patient has sought help. The likelihood of complete return of 
the voice via symptomatic voice therapy is excellent if the 
patient is ready to relinquish the voice sign and to bring the 
problem causing it into the open. The prognosis is also excel- 
lent if the original problem has disappeared. When the con- 
flict has not been resolved, the prognosis is poor for patients 
who are unwilling to face and deal with the underlying prob- 
lem causing the voice disorder, in which case either it will be 
difficult—if not impossible—to improve the voice by means 
of any therapy or, if improvement does take place, it may be 
incomplete or short-lived. 

Symptomatic voice therapy is subdivided into the follow- 
ing phases: patient education; symptom removal by means 
of direct voice therapy, manual musculoskeletal tension 
reduction, catharsis; and psychotherapy when indicated. 
The efficacy of symptomatic voice therapy is predicated on 
the following principles: 


¢ Reduction of laryngeal musculoskeletal tension will 
change laryngeal posture and facilitate voice. 


¢ Evoking voice by any one of a variety of means will con- 
vince the patient of his or her phonatory capability. 


Therapy Rationale 


Successful therapy for conversion voice disorders begins 
with the speech-language pathologist taking primary 
responsibility for patient management. The history of our 
profession's attitudes toward the treatment of patients with 
conversion voice disorders has been one of avoidance of this 
responsibility. It is often reasoned, erroneously, that because 
the disorder is primarily psychogenic, its treatment should 
be left in the hands of the psychiatrist or clinical psycholo- 
gist. Another time-honored belief held by clinicians of all 
specialties is that symptom removal of a conversion disorder 
is dangerous if its underlying cause is not treated and that 
the conversion sign will reappear someplace else. This idea 
dies hard despite the fact that psychiatrists no longer sub- 
scribe to it. Hundreds of cases of conversion symptom 
removal in many different body parts without adverse 
effects or subsequent conversions have been documented. 
The worst result will be failure to relieve the voice sign 
because of patient unreadiness, undoubtedly a continued 
defense mechanism. Psychiatrists now believe that reported 


adverse reactions to attempted symptom removal were 
caused not by the removal of the conversion itself but by the 
precipitous and traumatic manner by which it was removed. 
Experience with conversion voice disorders shows that pre- 
mature referral of the patient to the psychiatrist or psycholo- 
gist is almost a guarantee of failure to improve the voice. The 
validity of this statement is based on the following reasons: 


The voice disorder, a somatic sign, is, in the patient’s 
mind, dissociated from conscious awareness of any emo- 
tional problem. This is the reason why most patients 
whose voices have been treated through traditional 
“interview” psychotherapy alone have not improved. 


+ Symptom removal in conversion voice disorders requires 
laryngeal muscular alterations simultaneous with audi- 
tory feedback (i.e., treatment directed at the larynx itself). 


+ Despite what might be excellent training in psychodiag- 
nostics and psychotherapy, most psychologists and psy- 
chiatrists are not trained to deal with voice disorders. 


+ The unavailability of psychiatric and psychologic person- 
nel and their lack of interest in voice disorders because of 
the relative mildness of the psychologic problems that 
underlie most of them make conversion patients less 
acceptable to such specialists. The psychiatrist and psy- 
chologist become most important after the voice disorder 
has been eliminated or improved. Then, if the emotional 
factors are sufficiently severe, referral is warranted. 


+ The patient with a conversion voice disorder who is con- 
vinced that the disorder is organic more often than not 
refuses referral to a psychologist or psychiatrist, in the 
beginning. Only after the voice has returned do such 
patients become aware of psychologic causes; they are 
then more receptive to referral. 


There is, then, a proper sequence of clinical events that 
needs to take place in the treatment of conversion voice dis- 
orders: laryngologic clearance of organic laryngeal disease 
and systemic illness; symptomatic voice therapy coupled 
with ventilation and elaboration of emotional problems, if 
the patient is capable of doing so; and referral to the psy- 
chologist or psychiatrist to achieve a more enduring stabi- 
lization of the patient’s life adjustment. A mature, cautious, 
and well-trained speech-language pathologist will not harm 
the patient. On the contrary, the opportunity given the 
patient to discuss emotional problems simultaneously while 
undergoing the more mechanical aspects of voice therapy 
can only serve to provide that patient with new hope for a 
better voice. It is for patients such as these that we advocate 
so strongly the interdisciplinary practice of speech-language 
pathology in settings that bring psychologist, psychiatrist, 
laryngologist, and speech-language pathologist together, for 
such relationships provide a feeling of greater confidence on 
the part of all clinicians treating such patients. 


Patient Education 


As in other voice disorders, the first phase of therapy for 
conversion voice disorders is patient education. After 
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obtaining a complete history and after deciding that a con- 
version voice disorder exists, if the voice has not improved 
spontaneously during the history—as it sometimes does as 
the patient reveals underlying emotional stresses for the 
first time—the first necessity in therapy is to educate the 
patient about the cause of the aphonia or dysphonia. Almost 
universally characteristic is their initial insistence that the 
disorder is organic. This denial will delay, in many cases, and 
prevent, in others, successful voice therapy. Time needs to 
be taken to discuss the beliefs of both laryngologist and 
speech-language pathologist that the disorder is not due to 
infection, tumor, or other organic cause and that it is due to 
excess laryngeal tension. This may be due to emotional 
stress and/or a habituated pattern of muscle use. How the 
speech-language pathologist handles this informational 
phase can determine the difference between continuation 
on to success or early failure. The difference lies in the man- 
ner by which this information is communicated. Many 
patients previously seen by other clinicians have been told 
point blank that they needed to see a psychiatrist. Such pre- 
cipitous revelation of what may be essentially true is almost 
always indigestible by the patient. On the other hand, a 
gradual, sympathetic unfolding of this fact through discus- 
sion can make the difference between acceptance and rejec- 
tion of the idea. 

The first step is to divest the patient of the notion that the 
voice disorder is organic by reviewing the laryngologist’s 
findings with the patient. Firmly and authoritatively, the cli- 
nician should tell, or preferably read aloud, the laryngolo- 
gist’s report-that the larynx was normal on examination, 
that no lesions or paralyses were noted, and that the patient 
has the physical capability to produce normal voice. This 
information not only serves to start the patient thinking 
about emotional factors but also relieves anxiety about 
threatening laryngeal disease, namely, cancer. Patients show 
visible signs of relief upon hearing this report. Their anxi- 
eties about laryngeal disease are real, and with the restate- 
ment about the normal laryngologic examination, the 
patient’s mind is cleared of fears of serious illness. The germ 
of the idea that perhaps the voice is due to emotional causes 
after all is thus planted. For some patients, this is enough for 
the moment to begin direct symptom management before 
dealing more directly with the emotional issues. When the 
clinician presents the concept, in language appropriate to 
the patient’s educational background, the clinician states 
that the larynx is a common site for the expression of emo- 
tional problems. Most people are already aware of this fact 
on the basis of lifelong experience. It can be pointed out to 
them how, in everyday life, we express our emotions—anger, 
fear, grief—through the larynx, and how it sometimes under- 
goes violent, uncontrollable muscular reactions that can be 
felt as well as heard during transient emotional states. The 
rate of informational exchange, the pacing, and the clini- 
cian’s finesse develop with practice. It is wise to stop at vari- 
ous points along the way to sample the patient’s reactions to 
this new knowledge. It is always interesting to see how dif- 
ferent patients with conversion voice disorders react to the 
idea of psychogenicity. A few are so threatened and incensed 
by the idea that they refuse to continue the interview, grab- 
bing for their coats and purses in preparation for terminating 


the session. Fortunately, although most patients react at first 
with skepticism, they are willing to listen. Some amiably 
agree. They “knew it all the time” but needed someone else 
to confront them with the facts. Once the patient is willing to 
consider the possibility that the voice disorder is related to 
emotional conflict or stress, the stage is set for the next 
phase. 

If the patient has not already gained voice, the clinician 
proceeds to inform the patient that there has been a loss of 
contact between the desire to produce voice and the volun- 
tary muscular ability to do so. After working with several 
patients who have conversion voice disorders, the clinician 
will eventually become convinced that the patient is gen- 
uinely under the impression that they are unable to phonate 
more than they do at the time of examination, even though 
they have been making audible sounds via throat clearing 
and coughing, thus failing to appreciate the relationship 
between those signals and the potential for normal voice 
(see Chapter 9). 


Direct Voice Therapy 


During direct voice therapy, all attention is paid to the larynx 
and voice, and psychologic considerations are set aside. One 
should test first for the presence and extent of muscu- 
loskeletal tension, which almost invariably accompanies 
conversion voice disorders. The identical procedures for the 
relief of isolated musculoskeletal tension are employed at 
this time. Incorporated into the manual relief of such tension 
are the clinician’s instructions to the patient to produce var- 
ious laryngeal noises—coughing followed by an extension of 
coughing into continuous voicing, throat clearing, and hum- 
ming. The purpose of this activity should not be lost on 
either clinician or patient: it is to convince the patient that 
there is more voice capability than the patient realized. 
When increased laryngeal sound occurs, it should be pointed 
out instantaneously. 

Patients vary in their rate of voice improvement and in the 
acoustic characteristics that accompany such improvement. 
Those who have had aphonia or dysphonia for long periods 
of time will go through several stages of dysphonia, which 
may require as much as an hour of therapy before normal 
voice is approached. These dysphonic transitional stages can 
take the form of raucous hoarseness or harshness. They must 
be recognized as transitional stages and must be worked 
through to normal voice. Patients react early to voice 
improvements with much surprise, even astonishment, 
proving their obliviousness to their actual voice capabilities. 
Their voice improvements should be met with the clinician’s 
approval and pleasure. Patients who have had conversion 
voice disorders for shorter periods will often shift from 
severe dysphonia or aphonia to normal voice within min- 
utes. Others who are not ready for symptom removal either 
will fail to respond with any voice improvement or will pro- 
duce varying degrees of voice improvement short of normal 
and then plateau despite continued efforts at therapy. Such 
patients are not candidates for further therapy at that 
point and ought to be asked to return for a second or third 
try. Some will fail to keep future appointments, for obvious 
reasons. 
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Catharsis and Counseling 


The next phase of therapy is catharsis and discussion of emo- 
tional causes. Upon experiencing a return of their voices, 
most patients will express emotional relief, usually a mix- 
ture of smiling and weeping, which they attribute to their 
happiness for having been relieved of the burden of the 
abnormal voice. Such emotional reaction is healthy and is to 
be encouraged, despite the patient’s protestations of embar- 
rassment. One should discuss with them their feelings about 
the return of the voice while at the same time encouraging 
them to use the voice to its fullest. This is a good time to get 
the patient to talk about the frustration, embarrassment, and 
other effects of the voice disorder. Most will describe consid- 
erable inconvenience that they and their families have suf- 
fered as a consequence of the disorder. The patient’s change 
of facial expression and general attitude should be noted. 
The face is less tense, less frowning; there are fewer signs of 
distress around the mouth and eyes, and a general impres- 
sion of happiness. The patient’s attitude will change to one of 
increased openness, a willingness to talk more freely, and, 
most importantly, an apparent improvement of insight and 
memory for the events, feelings, and attitudes that gave rise 
to the voice disorder in the first place. 

For patients who are receptive to discussion of psychogenic 
causes, despite earlier denial of them, this is an excellent time 
to reopen the subject. It is a never-ending source of wonder 
how often patients become aware of emotional conflicts only 
after the voice has returned and how they go on to describe 
acute or chronic stresses and conflicts in their lives, usually of 
an interpersonal nature and having to do with a breakdown 
in communication with persons close to them and in whom 
they have much ego investment. The speech-language 
pathologist will discover that quite ordinary irritations are, to 
many patients with psychogenic voice disorders, monumen- 
tal. One quickly learns that each person responds to life’s 
problems with different degrees of emotional reactivity. Here 
are some examples: loss of voice after an argument with a 
relative or friend; loss of voice because of feelings that one’s 
position of authority has been usurped in an office setting; a 
feeling of having been wronged in the settlement of a family 
estate; inability to tolerate personal habits in one’s spouse 
and to confront that person with one’s feelings; deep frustra- 
tion, disappointment, or job dislike that one is unable to 
change for fear of repercussions from employers or family; an 
engagement or marital relationship that has gone sour in 
which the patient is unable to express the wish that the rela- 
tionship be discontinued. A common denominator runs 
through these themes despite the differences in their 
specifics: the need to express anger, dissatisfaction, or grief, 
but an inability to do so. Lifelong, these individuals have 
never been able to express their feelings openly and directly. 
Often, the speech-language pathologist is the first person to 
whom they have had an opportunity to reveal these unspo- 
ken and sometimes unspeakable ideas. 


Referral 


Finally, the speech-language pathologist and patient need to 
talk about the future. They need to decide whether the orig- 


inal emotional problem is still operative, whether it has been 
resolved or is resolving. With clinical experience, a certain 
practical sense develops in judging whether or not the 
patient can benefit from further professional psychotherapy. 
The decision for referral to a psychologist or a psychiatrist 
should be bilateral. The patient, however, must make the 
final decision. Does he or she feel happy or satisfied with 
life? Is this someone who has long felt the need for profes- 
sional help but has been too shy, afraid, indecisive, or con- 
fused as to how to obtain it? Realistically, not all patients 
with conversion voice disorders are candidates for psy- 
chotherapy. Many are superficial and denying; they either 
reject such therapy on the grounds of not needing it or, once 
begun, discontinue. Sometimes, all one can do is carry the 
person over the crisis by being a listener and a friend. 
Although speech clinicians do not practice psychotherapy, 
we have a responsibility for being the “advanced party,” for 
breaking new ground, and for helping the patient to discover, 
perhaps for the first time, that further help is needed. An 
impressive number of patients, upon learning that their 
abnormal voice was a primitive substitute for a more mature 
solution, will then attack their problems directly. Often, 
patients will obtain enough insight from voice therapy and 
discussion alone to make effective changes in their lives, 
usually those who have sufficient ego reserve and aggres- 
siveness. The passive and dependent types, overwhelmed by 
life’s punishments, cannot seem to get out from under, and 
for them psychiatric consultation and therapy are indicated 
and should be advocated as strongly as possible. 

The evolution of therapy for conversion voice disorders 
has arrived at the point where it is now known that mechan- 
ical, symptomatic voice therapy must be cradled in kindness, 
psychotherapeutic counseling, and respect for the unusual 
nature of conversion itself. The history of treatment of this 
disorder has been brutal and ineffectual because of the fail- 
ure to respect and recognize the patient’s potential for 
insight and growth. 

The following case studies show the interaction between 
patient and clinician during interviewing and voice therapy 
for conversion voice disorders. 


Case Study 11.1 


A 26-year-old police officer, on the force for 3 years, had 
never been happy in his work. He had never told anyone 
about his feelings for fear of disappointing his parents, 
who were very much in favor of this vocation. He had 
always been a quiet, unaggressive person who found it 
difficult to lose his temper, but when he did, he shook and 
became livid with rage. He found it difficult to reprimand 
motorists or pedestrians because of a strong need to be 
liked by everyone. The patient had had several previous 
episodes of aphonia and dysphonia, which, heretofore, 
had cleared up spontaneously. At the time of interview, he 
was aphonic. Laryngologic examination was normal. 

Clinician (C): What brings you to this clinic? 

Patient (P): (Patient’s voice is a complete whisper.) My 
voice. | haven’t been able to talk. It started April 9th; it 
came back for a few days and then it was gone again. But 
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it was all high-pitched. And then, finally, that went away 
again. And it has been like this for the last 3 or 4 weeks 
now. | had a bad cold. It started with laryngitis. And my 
voice started to come back again. Then, all of a sudden it 
was gone, just like it is now—completely. Altogether it was 
about 4 or 5 weeks that I lost it. 

C: How did it come back? 

P: | coughed and I had a voice back. The next time, it was 
in either September or October of last year, | had a slight 
cold—no sore throat or anything—but the next day my 
voice was gone again. And that lasted only for about 
9 days. 

C: How did it come back? 

P: | can’t remember exactly how that one did come 
back. 

C: Are you able to make any sound at all? 

P: Once in a while a sound will come out. 

C: Before we go any further, | wonder if you would read 
this aloud for me. (Just before beginning, the patient 
coughs audibly, but reads the passage in a completely 
whispered voice.) Now would you cough for me? (Patient 
coughs repeatedly, each time with an excellent glottal 
attack and brief audible voicing.) 

C: What kind of work do you do? 

P: Police officer. 

C: How long have you been doing that? 

P: Since January of last year. 

C: Have you been happy in your work? 

P: | wonder . . . if | actually do like the work that | am 
doing, or if I am just saying | do because | don’t want to tell 
everybody that | would like to quit and that | don’t want 
the police department. 

C: Disregarding what other people think, how do you 
actually feel about the job? 

P: Well . . . | have trouble giving somebody a ticket. | 
have trouble talking to a person. 

C: What is there about talking to people that is so diffi- 
cult? 

P: | get nervous. | am always afraid that I am going to say 
something or do something wrong... that’s going to get 
either me or somebody else in trouble. Because if you do 
one thing wrong in the police department, you might as 
well forget your job. 

C: In other words, when you are talking to a motorist 
who may have violated a law, you feel under pressure to 
be correct. 

P: | can tell them what they have done wrong, and if 
they don’t start arguing | am all right . . . But once they 
start arguing, then, well, if | get mad | start to shake when 
| get nervous like that. Then my words start to get mixed 
up. | know | shouldn’t get nervous. That is our job, and 
people expect it. It’s just something that happens to me. 
The worst time | got angry, | actually saw red. | got mad 
once, really mad, while | was in a fight, and a guy bit me; 
and then | really got mad. | didn’t think it was possible to 
see red, but | saw red. 

C: That must have been very frightening. 

P: | shook for about an hour and a half after it. 

C: Does being a policeman make you uncomfortable 
because you are in the position of being a disciplinarian? 





P: | never thought of that. I don’t like people to dislike me. 

C: Well, let’s say that you have to apprehend a motorist 
for speeding or some other violation. What are your feel- 
ings? What goes through your mind? 

P: If we see someone we think is speeding, we get in 
behind him and clock him with our speedometer . . . If he 
is speeding, we'll stop the clock which locks the 
speedometer so that we know what speed he is still going 
at, and stop the car and go over and tell him, you know, 
that he has been speeding. Once in a while, you will get a 
guy who will argue, but usually they will go along with 
you. Once in a while, you find a guy who is never wrong. 
His car can’t go that fast. And then, we start to argue and 
then I... I don’t know, | get flustered... 

C: You have the evidence there... 

P: | have the evidence, but yet, I... I know l am right and 
that’s what I don’t get. | know | am right and yet I am the 
one that gets nervous. When actually he should be the 
one. He is getting the ticket . . . | know | am sensitive to 
other people’s feelings about me... how people think of 
me... what they think of me. (At this point, discussion 
was discontinued and voice therapy begun.) 

C: | would like to explain something to you. You haven’t 
really lost your voice. You have temporarily lost the knowl- 
edge of how to make it function again. But the voice is 
there, and through certain activities there is a good likeli- 
hood that we can get it back. First of all, hum for me. (Clin- 
ician demonstrates. Patient does not seem to know how 
to go about producing a humming noise.) Make a cough- 
ing noise. (Patient coughs audibly.) Now do it again, 
except prolong it this time. (Patient prolongs the cough 
into a predominately breathy dysphonia.) Now, did you 
hear that voicing? That’s the voice. And | would like you to 
keep it going. Try it again. (Patient repeats the cough with 
the breathy prolongation.) Now, try it without the cough. 
Hum. (Patient begins to produce an intermittent, weak 
humming noise.) Yes, | hear it now. Again. (Patient 
repeats. After ~15 seconds of humming at the weak, high 
pitches, the pitch drops with a marked tremor, at the 
same time becoming increasingly louder. Suddenly, the 
patient speaks in a normal voice.) 

P: It’s back! 

C: Well! . . . Very good. 

P: Huh! . . . It’s back. (Voice quality and loudness are nor- 
mal.) 

C: How does it feel? 

P: It feels all right (tentatively). It’s not normal. It’s 
higher. A little more gravelly. 

C: This is fairly common, and it will come down to where 
it should be in a reasonably short time. 

P: Because, this happened to me before. It was real 
high, and then it went away again. But it’s not as high as it 
was then. 

C: Would you read this out loud for me, to strengthen it 
a little bit? (Patient reads the passage.) 

P: Wonderful! ... A little funny . . . that it was that easy 
to get back . . . Would it have come back like this if | had 
just walked in here, without talking to you fellows? Or is it 
because I’ve talked to you and the other doctor? 

C: What do you think? 


P: | think so... I mean | think that | might not have got- 
ten it back. 

C: Why do you think that talking to us has played a part 
in your getting your voice back? 

P: Because | know when | was talking to the other doctor 
(the psychiatrist), there is something that | brought up, 
that | started feeling funny. 

C: What was that? 

P: It was about the police department. He asked me 
about... that “you don’t really like your job and you are 
afraid to tell your family this.” And | agreed with him, but | 
felt real funny. 

C: Was it true? 

P: Yes... | have never told anyone. 

C: Having talked about it with somebody, how does it 
make things different? 

P: | shouldn’t be afraid to tell my family. It’s my life. | 
should enjoy what | am doing. | shouldn’t be working for 
something because they all think | should be working 
there. | should be doing something that | want to do. And 
not what they want me to do. That’s what | feel, anyway. 

C: You feel, then, that having said this to someone else 
has changed your attitude. 

P: Right. Because | had thought of it before, but | had 
never mentioned it to anybody. You two fellows are the 
first people | ever told it to. 

C: Was it hard for you to think about? To admit it to 
yourself? 

P: Uh huh. | didn’t want to admit to myself, you know, 
that | didn’t like the job. Like, me, | am a constant day- 
dreamer, and | dream about it. 

C: What would you like to be doing? 

P: My old job. 

C: Which was... 

P: Textile chemist. 

C: Tell me about that. 

P:I’d mix the chemicals, bleach the cloth, dry the cloth... 
| don’t know, | just enjoyed working. It was long hours. The 
pay wasn’t that great. But | enjoyed working there. 

C: What did you enjoy the most about it? 

P: | don’t know if it was because | worked alone. I'd 
maybe say 10 words a day. | didn’t mind it. | sing a lot. I’d 
sing all the time. And | enjoyed it. Or if | wanted to day- 
dream, | could daydream. 

C: What do you daydream about? 

P: You name it (laughs). Mostly about, oh | don’t know, 
money, being strong, you know, crazy things. 

C: How could your loss of voice be related to... . all this? 

P: Well, | don’t know . . . Subconsciously, it was the only 
way | knew of getting away from there . . . maybe 
rebelling. 

C: What do you think would be the solution to the prob- 
lem of the voice loss? 

P: | think it’s the job that’s causing it. Maybe I’m wrong. 
Maybe I’d get another job, and it would go away again. | 
don’t think so. 

C: Do you really think that you will be able to put into 
action what you would really like to do? 

P: Well, | think that now that I have talked to you fellows, 
and | know that it’s because | don’t want to tell my par- 
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ents, maybe now | can tell my parents. Because | was 
always shutting it out before. | never wanted to think 
about it. Maybe it wouldn’t bother them half as much as | 
think it would. 


Case Study 11.2 


A 46-year-old cosmetologist had been under chronic 
stress resulting from several sources: her concern about 
her son, a rebellious, belligerent young man of 25; her 
unspoken rage at her husband, who had never supported 
the family completely and usually didn’t work at all; and 
her chronic irritation with her cosmetology-salon employ- 
ees. She insisted on perfection from them yet was never 
able to voice her irritations when they fell short. Instead, 
her voice faltered into an intermittent hoarseness and 
whisper, and she was overwhelmed with fatigue and usu- 
ally forced to go home and take to her bed. Her marginal 
insight into all this was evidenced by her statement that “I 
am afraid my employees think that | am angry with them 
when | lose my voice and can’t talk.” The patient had had 
many previous brief episodes of dysphonia and aphonia, 
which had cleared up spontaneously. The most recent 
one brought her to this clinic because of its unusually long 
duration (9 weeks). At the time of interview, her voice was 
an intermittent dysphonia-aphonia. Results of laryngo- 
logic examination were negative. 

Clinician (C): Is this your first loss of voice? 

Patient (P): No, I’ve had it many times, but I’ve never had 
it for more than 14 days. 

C: But this last time it’s been longer. 

P: Started my ninth week yesterday. | know one thing; it 
seems that | always lose my voice in October. One of my 
problems is that | have trouble hiring efficient help. 

C: Have you felt at all depressed or unhappy whenever 
you have lost your voice in the past? 

P: I’ve tried to analyze that myself, and | don’t come up 
with the answer. A couple of my better employees say that 
they think that this directly follows mental strain—if | 
become upset over something—but | haven’t been able to 
look back and prove to myself that that’s true. 

C: Does it create any problems, your not being able to 
use your voice normally? 

P: Yes, it makes it so that I’m not really able to cope with 
my business activities. You can’t answer a telephone, peo- 
ple just hang up on you... . | can’t conduct extracurricular 
activities. 

C: What else? 

P: (With great emphasis) | can’t express myself at all. 
There is no intonation in your voice at all. You sound like 
you are angry all the time. 

C: Well... we would like to help you get your voice back, 
but we would also like to learn more about you. Would 
you have any objections to staying on for a few days and 
being interviewed by some of our psychiatrists? 

P: Not at all (surprised). You’re not a psychiatrist? 

C: No. | am a speech pathologist. 
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P: You’re certainly putting a lot into it, psychology-wise 
(embarrassed laughter). 

C: It’s all interrelated, isn’t it? Before we arrange these 
interviews, we would like to help you get your voice back, 
and | think we can start right away. Would you open your 
mouth, now, wide as you can? Now say /a:/. (Patient pro- 
duces a weak, high-pitched, scratchy voice.) Make it 
louder. Now stick your tongue out, Again, now, /a:/. 
(Patient produces a harsh voice a shade louder, which has 
a rapid tremor.) Kind of a tremorous voice, isn’t it, but it’s 
there. Try it again. Let’s do something else now. Tilt your 
head back, now open your mouth as wide as you can and 
stick your tongue way out. Now say /a:/ and keep that up. 
| want you to listen for that sound and try to keep it going. 
Try it again, real sharp now. (Patient produces slightly 
louder tone, momentarily, and is encouraged to listen for 
it and repeat it.) It sounds worse than it is, doesn’t it? 

P: It doesn’t hurt... at all. 

C: Can you hum for me? (Patient produces a very clear 
high-pitched tone on humming.) Oh, very good. Take a 
deep breath and do that again. Can you do that a little 
louder? (Tone clearer this time.) Now try /ma:/. (Patient 
produces a high-pitched, breathy voice, which is an 
improvement over previous productions. She is asked to 
repeat this several times.) Let’s try /ma:/ again. (Patient 
produces this equally well.) It’s still too high, isn’t it? 
That’s not your normal voice. 

P: No. But I’ve never been able to sing for years. (Patient 
continues to produce a variety of vowels with gradual 
reduction of breathiness. However, there is a great deal of 
quavering or tremor to the voice on vowel prolongation.) 

C: Let’s go down one tone, now. (Patient complies, and 
the vowel /a:/ is produced at progressively lower pitches. 
Vocal loudness is also increasing. There is a strained, harsh 
quality, nevertheless.) Very good. Continue. Now let’s 
make it real smooth. . . Let’s try it again. Do you hear the 
way your voice catches now and then? Do you feel a tight- 
ening whenever that happens? 

P: FII... take... better notice. 

C: Now | want you to say this: M-o-n-d-a-y. (Clinician 
produces it in a monopitch. Patient repeats, “Monday,” in 
a monopitch with fairly good clarity, although with some 
harshness and voice tremor. She is asked to produce this 
word at successively lower pitch levels. As she repeats this 
word, and as the voice quality and loudness improve, she 
becomes increasingly tearful, and the voice begins to 
crack.) This improvement in the voice seems to have 
affected you. True? 

P: Yes... It relieves me. 

C: You notice how your voice has dropped down to a 
lower pitch? | think that’s where it wants to settle. Let’s 
see if it gravitates down in that direction. Try it again. 
(Patient repeats sentence.) Yes, there it goes down to the 
lower pitch. That’s fine. Continue. (She is then given a pas- 
sage to read aloud. She does so in a nearly normal voice. 
As she is reading, the clinician intersperses requests that 
she speak louder at various points in her reading. With 
that last reading of the passage, it is apparent to both cli- 
nician and patient that the voice is normal or near nor- 
mal.) How do you feel now? 





P: Tired . . . sore . . . but good! 

C: Go on. 

P: How do | feel?...1 feel...uh... sort of exuberant, | 
am sure...it’s...beenalong...long time. | feel good. 
Now my voice sounds more normal. | feel nice. | feel good. 
(Gives a great sigh.) Now what happens? . . . Is this going 
to happen every once in a while? When | started talking 
again, | always just started talking. | never had to struggle 
like this. | would all of a sudden realize | was talking. | 
would be whispering and all of a sudden | would be talk- 
ing. That feels better. It sounds better, too, doesn’t it? 

C: Is it completely back to normal? Or does it have a way 
to go yet? 

P: Well, | am afraid | have a way to go. My normal voice 
doesn’t take any effort. 

C: Does it still take some effort now? 

P: Yes, it still takes effort. (Patient resumes practice. 
Pitch of voice is gradually decreasing, loudness is increas- 
ing, and quality is clearing. After this, she is asked to 
employ spontaneous speech in an effort to enable her to 
use her voice in more automatic speech situations. She 
describes the furnishing of her business establishment.) 

C: Let’s talk about these voice losses. Do you have any 
warning that you are going to lose it? 

P: | do seem to have a warning when | am going to lose 
my voice ... | have this tight feeling in my chest. However, 
| have never associated it with any happening in my life. 
These are such common, everyday things that it might 
just take an accumulation to make a person, all of a sud- 
den, fight back some way or another, you see? 

C: By “fight back,” what do you mean? 

P: Well, my husband says, and | think he has got it pretty 
straight, maybe, that | get so sick and tired of talking to 
women day in and day out.. . . | like my work, but he says | 
get so sick of women talking and yakking and the daily 
noise around me, that finally | make up my mind I’m just 
not going to talk. Well, he might have something there. | 
don’t know. Maybe things pile up on me or something, ’til 
I'll just fight back by not talking. l'Il show them. | don’t 
know. Day in and day out, we have to talk to people and be 
nice to them, which, of course, we want to be, and, uh, if 
someone comes in and throws themselves into your chair, 
starts to be a little bit obnoxious, she’s had an argument 
with her husband, or she didn’t like the news on the radio, 
or one of her children acted up before he got on the 
school bus... You know there is something in the back of 
her mind, so the first thing you do is try to kind of butter 
her up a little bit. Now, it’s a very very trivial thing, but 
uh... when you’ve done this about 10 times in 1 day you 
get sort of sick of it after a while . . . and there is one thing 
that | cannot do when | don’t have a voice, | can’t do that! 

C: Do you think that your voice loss comes about after a 
series of stressful events? 

P: | have asked myself this same question many times 
before. As | have said, one of my valuable employees has 
told me that she has noticed over the years that every 
time | have a real nervous spell . . . not every time . . . when 
| have a real nervous time, perhaps not just one day of it, 
that | oftentimes get laryngitis. So, since she told me that, 
and | had never heard her voice this until this last siege of 


laryngitis, | have been trying to think that through and see 
if she is right. If this is true, | still don’t know how to keep 
from having it happen again. (Patient sent for psychiatric 
consultation at this point. Resumes next day.) 

C: Tell me, what did you do after your appointments 
yesterday? 

P: (Patient’s voice virtually normal. She is in good spir- 
its—almost childlike attitude; many pitch and inflectional 
changes; generally a great deal of melody to the voice.) 
Well (laughs), we went down to one of the restaurants, 
had a bit piece of cake, and celebrated, and we made up 
our minds to hunt for one of my husband’s friends living 
near here. | was tempted to go back to my room and go to 
bed, but | didn’t think that would be smart. | felt that | 
should keep talking, and driving in my husband’s truck | 
would have to talk loud and keep talking, and so, | just 
talked over the noise of the motor, and | think my husband 
helped me along. He asked me everything twice (laughs), 
on purpose, | think. 

C: Is he pleased with your voice? 

P: Yes! | should say so. 

C: So, you feel that your voice is normal now? 

P: Uh huh. 

C: You feel well? 

P: Uh huh. 

C: Is there anything that you would like to ask about? 

P:No...uh...! would like to know about... about my 
psychoanalyst [sic] yesterday . . . | find that very interest- 
ing. I’d just like to know a little bit more about myself. 

C: | talked with him [psychiatrist], and he confirmed 
what we had discussed yesterday, that your voice loss is 
related to your way of reacting to the stresses around you. 
He thought that you have the habit of bottling up your 
emotions, instead of allowing yourself to express how you 
feel, like blowing off steam. 

P: That | know about myself. 

C: Do you have any idea why you can’t? 

P: | have always wondered about myself in this respect. | 
never can express love or emotions of that sort, either, 
without forcing myself. That’s probably why | don’t blow 
up, too... Now, I am sure that | feel just as deeply as any- 
one, but I just don’t talk about it. 

C: Do you know why this is true? 

P: No, unless it’s my upbringing. My parents were very 
close to each other, and they never argued. | never heard 
them argue in my whole life. | see this as something 
unusual. However, neither did | ever see my father kiss my 
mother more than maybe a couple of times in my life. | 
mean, there was no outward expression of love. | think 
maybe | was sort of ashamed to express myself. | wasn’t 
brought up to wear my emotions on my sleeve. 

C: How do you feel toward people who do express 
themselves in these ways? 

P: Oh! | think it’s wonderful! | really do. That is the only 
way to live. 

C: Do you wish, perhaps, that you could live that way? 

P: Yes, | think maybe that those around me would like 
me better. 

C: Which of your parents do you think kept their feelings 
to themselves more? 
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P: Mother did. Very much. 

C: Were you close to your mother? 

P: (Emphatically) Yes! My father was 20 years older than 
my mother. He just worshipped me. | know that. But | was 
still closer to my mother. 

C: Do you feel that you’re like your mother? 

P: Yes. Uh huh. Everyone does. | mean, in appearance, 
and they say the older | get the more | look like her and the 
more | act like her. | always felt that | couldn’t be any bet- 
ter than that. 

C: Do you think that you are like her also in the respect 
that you don’t show your feelings externally? 

P: | suppose you are right! You know, | never thought of 
that? | never thought of being emotionally like my mother. 
| presume that | have often expressed the desire to be. 
Because she was a person who was always so well liked and 
never seemed to resent anyone. | never heard her say any- 
thing unkind about anyone in my whole life. Women 
would come to our house and gossip, sometimes, and 
mother would clam right up and not say a word. For a 
woman not to ever gossip is almost unheard of. 

C: Do you think she allowed herself to be taken advan- 
tage of? 

P: Yes, | think she did. 

C: Do you think you are like that? 

P: Not as much as she was. | let people run over me at 
times. That’s probably half my trouble in my business. | 
will let someone keep getting away with something until 
it builds up too much, | suppose. | should not have let 
them get away with it to start with. But rather than put- 
ting my foot down and saying, “Now look, we don’t do 
things that way,” that this is it, | would let them get off 
easy. 

C: Well, why don’t you? 

P: (Laughs.) I am going to try, l'Il tell you. 


Mutational Falsetto (Puberphonia) 


Despite anatomically and physiologically normal respiratory 
and phonatory structures, mutational falsetto is produced 
with the larynx positioned excessively high in the neck and 
vocal folds lax and stretched thin anteroposteriorly. Phonation 
is produced with abnormally reduced infraglottal air pressure 
caused by shallow respiration. The voice is high or intermedi- 
ate in pitch, weak and thin, breathy, hoarse, and gives the 
impression of immaturity, passiveness, and effeminacy. 

Voice therapy begins with asking the patient to breathe in 
deeply and to produce a vowel or cough with a sharp glottal 
attack, as demonstrated by the clinician. The pitch will break 
abruptly to a lower level. If the sharp glottal attack is unsuc- 
cessful, one should manually pull the larynx downward as 
the vowel is being produced. If these measures fail, while 
holding the larynx down, one should depress the tongue 
with a tongue depressor as the patient produces the sharp 
glottal attack. Alternatively, asking the patient to yell as if he 
needed someone from the other end of a football field may 
bring about a lower voice because he can’t yell at the high 
pitch. 
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The low-pitched voice should be identified immediately 
for the patient. If the disorder is actually mutational falsetto, 
it is important to explain early in therapy that the low- 
pitched voice is the normal one and that the patient must 
learn to think of it as the target voice in therapy. This infor- 
mation must be conveyed persuasively, because the patient 
will not be completely convinced until after habituation of 
the low-pitched voice. 

As soon as the low-pitched voice is produced, it should be 
maintained through repeated vowel production using a 
sharp glottal attack. The therapy is not a gradual stepwise 
progression from the high to the low pitch. The break must 
be sudden and complete. Phonation must be forceful and 
vigorous. The voice will want to shift up to the falsetto, in 
which case the patient will need to be reminded to breathe 
in deeply and then phonate sharply. Increased depth of 
breathing is fundamental to success in therapy, and the 
patient needs practice in phonating after deep inhalation. 
The clinician judges on a moment-to-moment basis how 
long to remain on vowels before attempting words, sen- 
tences, and conversational speech. 

In a study of the pitch and quality characteristics of 10 
adolescent males aged 13 to 18 years and two adult males, 
ages 26 and 29 years, who had mutational falsetto, Hammar- 
berg (1987) found that the mean change in fundamental fre- 
quency in 10 of the subjects between 13 and 18 years of age 
before and after therapy was 221 Hz (range, 168 to 288 Hz) 
and 119 Hz (range, 105 to 135), respectively. In the two males 
26 to 29 years of age, the mean fundamental frequency prior 
to therapy was 198 Hz and after therapy 100 Hz. 

A reasonably stabilized voice should be achievable within an 
hour. Three to five additional therapy sessions are usually nec- 
essary to effect habituation. The most difficult phase of the 
therapy for mutational falsetto is getting the patient accus- 
tomed to the low-pitched voice. Gradual guidance into outside 
activities is necessary. Embarrassment upon returning to school 
is common among adolescents with mutational falsetto, and 
for that reason, therapy during vacation is desirable. 

Patients who have mutational falsetto, for reasons of 
immaturity, lack insight into the reason for their voice disor- 
ders and even when needed often make poor candidates for 
counseling and psychotherapy. Fortunately, in most cases 
the cause is no longer operative by the time the patient 
comes for voice therapy. A complete and permanent low 
pitch can be attained without delving into either the psycho- 
logic or physiologic reasons for the failure of voice change. 
Occasionally, despite their ability to produce the low- 
pitched voice, certain patients will refuse to proceed with 
the therapy because they are threatened by the prospect of a 
masculine voice. Discussion with the patient and parents 
may result in a referral for psychologic or psychiatric ther- 
apy. Such patients, however, often return for therapy later 
after having become accustomed to the idea of the lower 
pitch. 


Case Study 11.3 


The typical sequence of events in therapy for mutational 
falsetto is given in the following study of a 21-year-old 
male college student. 


Patient (P): | guess it was in the ninth grade when my 
voice first started changing. Seems like it started growing 
in spurts. The first time | was really aware of it was in 
stressful situations. We had an English class where you did 
a lot of book reports, and I'd try to get up and give a report 
and it would be long periods where it wouldn’t break and | 
couldn’t get anything out. | just kept getting worse and 
worse. At first, it would crack a lot. And a lot of people 
would say it was so shrill. It would sound just like a girl’s, 
that high. Then through high school, in the tenth grade, it 
was still the same way, but in the twelfth it got to where it 
wasn’t as shrill and it didn’t break quite as much. Like, 
when a substitute would come and call the roll, and I'd yell 
“here,” it wouldn’t even come out; I’d have to try it twice. 
One guy where I work said, “Whenever you get excited 
you sound just like a girl.” When | was relaxed, it would be 
hoarse. After swimming, if | was cold, tired, or really nerv- 
ous, it would get much worse. When | got to college | 
could try to make it where it wouldn’t crack, but it still 
sounded hoarse. 

Clinician (C): Do you have some confusion in your mind 
as to which is the right voice? 

P: Yeah. But the cracking didn’t bother me a great deal. 
We’d moved about six or seven times, and in the seventh 
grade it was really traumatic. | was extremely shy. | was so 
shy it was really incredible. | don’t think | ever went any- 
where in the seventh grade except to school and then 
straight home. In the eighth grade, | had a pretty low self- 
image. But | think I’ve grown out of it a lot. | had a low self- 
image in the seventh and eighth grades and | was shy, and 
then it began cracking in the ninth grade and that about 
shot me down. There wasn’t a lot of school-ground taunt- 
ing, but, like, if you’d answer the telephone, you’d know 
what people were really thinking. 

| put up with it through the whole ninth grade, and 
then we ended up going to about three or four throat 
specialists, and they said the voice would change. The 
last doctor | saw was a year and a half ago; he said it 
would change, too. Physically | was in real good shape, 
and | had a real good home life; it was my only refuge. | 
always wanted to be like my father. Do you think it was a 
fear of the cracking that made me stay at the high pitch? 
| think it is possible. 


Therapy 


C: Have you ever noticed that when you shout, your voice 
will drop to a lower pitch? 

P: Yes, sir, when | shout I’ve noticed it. | have trouble 
shouting, though. 

C: The first thing | think you ought to know is that the 
low-pitched voice is the normal voice. For some reason 
you haven’t been using enough muscular effort in exhaling 
for voice, and if you have a weak flow of air when you 
speak, it has a tendency to produce this high pitch. But 
when you get good pressure under it, the voice will go 
down. There is a second factor—how forcefully you are 
bringing the vocal folds together. And I'll show you how we 
can accomplish that. | want you to sit up straight and do 
this for me. (Clinician demonstrates a sharp glottal attack. 
Patient repeats, and the voice drops to a low, masculine 
pitch.) See what | mean? Do it again. Now | want you to 


hold it out as long as you can. Now produce the old voice. 
(Patient does at the high pitch level.) Hear the difference? 
Do you notice how much less effort you have to exert to 
make the high-pitched voice? Try the low one again, sus- 
taining the vowel. There is an interesting little quiver or 
warble to the voice which may be a sign that you haven’t 
been using the vocal folds and they are a little bit flabby. 
Now say “Monday.” (Patient says the word, the first syllable 
at the low pitch, the second at the high. With slightly more 
effort on the second trial, the entire word is produced at 
the low pitch.) Say “one Monday,” sharply. Now, “one 
Monday morning”; take a deep breath first. Now we'll do 
some exercises. (Clinician demonstrates five vowel sounds 
in short succession, each produced with a sharp glottal 
attack. Patient repeats all at a low pitch level. This exercise 
is repeated for several minutes. After that, the patient is 
asked to count to 10 sharply.) Sound all right? It didn’t 
crack, did it? Now say the days of the week the same way. 
Notice that when you slack off on the effort, the pitch goes 
up? Get into the habit of breathing deeply and putting 
forth effort when you talk. Now, pick up that card and start 
reading with that voice. (Patient does so with very few 
falsetto pitch breaks.) Not bad. All right, now let me hear 
you start counting and see how far you can go. Now I’m 
going to ask you some questions, and | want you to answer 
in a low-pitched voice. Remember, sharp and loud. How 
old are you? (Patient answers 21 in a falsetto voice.) Let’s 
go back to the vowels. Produce them sharply. Now, how 
old are you? (Patient responds in the low-pitched voice.) 
Are you getting the feel of it? Now tell me about school, 
where you go, what subjects you’re taking, and so on. 
(Patient begins to describe his college experience some- 
what hesitantly, but in the low-pitched masculine voice.) 

C: Well, what do you think of your voice now? 

P: Strange. | just can’t believe that was it. 

C: It sounds completely normal to me, but | know it 
sounds peculiar to you. It’s not cracking, is it? Do you like 
the new voice? 

P: Yeah (somewhat tentatively and quizzically). 

C: Well, just keep talking and get some experience with it. 

P: | love to play handball. I’m starting to get halfway 
decent in gymnastics. At first, | was going to be a business 
major, but it wasn’t for me. So | decided on physical ther- 
apy. I think I'll be real good at that. 

C: A while back you said your mother was an unusual 
person. Could you tell me more about that? 

P: | guess | consider her pretty unusual. I’ve had a pretty 
warped view of people in general, twisted, well, not really 
warped; | saw a lot of cruel things. My mother is real sensi- 
tive. She teaches children with learning disabilities. And | 
think she is probably one of the best in the state. People 
can be callous and they can be cruel, at school, mostly 
toward me, but to other people who were shy too. She has 
a great deal of empathy. She’s kind of headstrong though. 
My father, he didn’t have a lot of time, say, when we were 
growing up. You know, the old deal about making money. 
And about 2 years ago, he changed more than any person 
ld ever seen. He was pretty dogmatic and rigid, and he 
became more understanding and more open, just less 
structured, kind of seemed more human. He’d played ona 
basketball scholarship and | really wanted to be like him, 
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quite a bit. With my sister, we fought a lot. She is real 
pretty. The guy she married, we just don’t make connec- 
tions. I’m not kidding, he’s really a strange character. (All 
the above is produced in the low-pitched masculine voice, 
with few pitch breaks. His rate of speech progressively 
increases as he becomes more comfortable with the new 
voice.) We got real close since she has been married. We 
really have fun with her now. Except, | think she could have 
reached a higher potential if she had married someone 
else. She could really write. In English classes, her name 
would come up a year later when | would take those 
classes, and her teachers would say, “She could really 
write. | hope you’re as good as she is.” 

C: You sound as if you are a sensitive person and that 
you have high standards. 

P: I'd like to believe that. 

C: Do you think that you’re oversensitive? 

P: Definitely. As a matter of fact, about 3 months ago, | 
finally told mother—like that one doctor, he’s crazy, think- 
ing that my voice was still going to change, and it got to 
where | was becoming callous. I’d put up with the voice for 
7 years, and | didn’t have too much patience with people. 
It might be that | didn’t try to understand them because | 
was too wrapped up in my own problem. Like the first two 
doctors, | thought they had to be right, and then the best 
one in the state said that at 19 | just needed more time! | 
was pretty well disgusted. Mother asked me once if | ever 
forgot about my voice, and | said, “No way.” (At this point 
the session was terminated, and the patient was asked to 
return the following day.) 

C: What were your parents’ reactions to your voice? 

P: Well, | called my mother on the phone and | didn’t tell 
her who | was at first, and when she realized who | was 
(laughs), she broke down and started crying quite a bit and 
said that she didn’t think she would even recognize me 
until | told her. | almost didn’t sleep last night, and when | 
woke up this morning | had to concentrate quite a bit to 
get the voice started. My dad said it was deeper than it was 
yesterday, and we were outside shopping and it was cold 
and | said a few words and they came out the old way, but | 
caught it right away. That was the only time it ever slipped. 

C: Does it sound more normal to you now? 

P: Yeah. | went over to start talking to Daddy and | 
almost got lightheaded; | wasn’t used to breathing that 
deeply. | used it in restaurants and shopping. It was fun. 

C: Do you have any other impressions of the voice? 

P: Not really. | told Daddy the whole story. | just told 
him | couldn’t believe it, and | don’t guess he could either. 
Even he said he couldn’t hope for that much. Because 
they had both been preparing me for partial improve- 
ment, maybe having to go home and work at it, and it 
would take maybe a few months. | thought to myself that 
| don’t know if I’m going to be able to go back and talk to 
my friends and relatives. | guess it was concern about the 
hassle of drawing attention to it. Then, | thought that 
after talking to Mother and Dad, that that’s the farthest 
thing | should be worrying about. In fact, all my aunts and 
uncles and cousins are coming over Friday, and | can 
hardly wait. But | had thought about it, that it would be 
some hassle. 

C: Do you feel satisfied? 
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P: Totally. It’s more than you could really hope for in 
your wildest dreams. When | was in high school, people 
would treat me like | was a poor little kid. You know, | can’t 
go back to it, but | really could have been a fantastic bas- 
ketball player. | could have been on a scholarship now, but 
you know, | just kind of copped out. You know, my mother 
told me once that she thought I’d gotten used to the 
voice, and | just couldn’t believe it. And | said, “Mother, 
you’ve never been to a place where you have to give a 
speech or talk to a stranger.” For a while it dictated my 


whole life, what I’d do and what I'd try. 


The patient’s father wrote 2 months later: 

Prior to the change in Mike’s voice, | felt that he was a 
shy, sensitive person with a very poor self-image. Dur- 
ing Mike’s early years, | believe he was what | would 
call “painfully shy.” Mike was not a confiding person, 
and it was difficult to get him to engage in an open- 
ended discussion, what | would call a “bull session.” 

The change in Mike’s voice has brought about a pro- 
found change in his view of himself and his relation- 
ship to other people. Since things are now so much 
more positive, Mike opens up much more easily and is 
quite willing to discuss anything and everything. 

Prior to Mike’s session with you, | was very con- 
cerned about his future. | now feel that his confi- 
dence and self-image have improved so sharply that 
he will reach his potential as a person and will have a 
rewarding life. 

Mike now participates in class discussions. He had 
told us that during 4 years of high school and 3 years 
of college, he had never before done so. 


The patient wrote the following letter 3 months after 
acquiring his masculine voice: 


Most things don’t measure up to their expectations. 
My new voice was one that did. | was in a state of bliss 
when I returned to . When I got home my 
mother and sister were the first people | talked to. | 
described every detail and emotion of my trip to the 
clinic. 

When | first came in contact with people outside 
my family, it was an almost laughable situation. I’d 
say something and their faces would register shock, 
but they’d never comment about my changed voice. 
| assume that in the past, by my action and behavior, 
it was well understood that my voice embarrassed 
me and not to mention it. Butit was so unusual to me 
that not a single person outside of my family has ever 
mentioned or commented on my new voice. And for 
once in my life | was more than willing to discuss it. As 
a matter of fact, | wanted to talk about it. | have finally 
come out and said a few things to my friends, who 
would immediately be full of questions. We ended up 
having very enlightening conversations. 

The peer pressure was incredible. I’ve never even 
so much as flinched before it. Nothing could force me 
back to the misery that was before my trip. But | can 
see how powerful role expectations are and to have 
such a major change in not only voice but identity. 
Because, too often after hearing my voice people 


tacked on behavior traits to me as well. They 
expected me to act gimpy, since | talked gimpy. | 
don’t know if anyone else can understand how all- 
encompassing my voice was to me. My father asked 
me when | had returned if | thought my change of 
voice would have a very major effect on my life. | 
could have cried. It was not part of my life, it was my 
life. Every action | made, every decision, opinion, all 
of my behavior was what it was because of my voice. 

It takes a long time to get everything straight- 
ened out. To this day if a phone rings and |’m by 
myself l'Il find myself trying to warm up my vocal 
cords by talking to myself. The behavior patterns 
don’t fall by the wayside as rapidly or as painlessly as 
did my voice. There’s a whole way of life involved in 
coping with my voice since it bothered me so much. 
And it takes effort and desire to overcome that way 
and replace it with another. In a class once a fellow 
student made a comment that no one knew what 
Hell was like. | smiled to myself, because in all hon- 
esty and sincerity | felt | was living a Hell and nothing 
could be worse. 

PII be going to a new school this semester. My 
whole life is one uphill swing since the clinic. Every- 
thing keeps getting better. After visiting four special- 
ists who did nothing, | had a very low opinion of doc- 
tors. You raised that opinion, and | thank you for your 
competence. After saying that it was a living Hell | 
must also say that my voice gave me a unique per- 
spective on life. I’m glad that | am what | am. I’m 
thankful that | can relate to unfortunate people—to 
someone with a handicap. That I’m not someone 
who has only tunnel vision because they can’t see 
outside of their own experiences. 


Spasmodic Dysphonia 


One of the most complex voice problems encountered by 
speech clinicians is spasmodic dysphonia. Because of its 
complexity, a separate chapter is devoted to description and 
treatment of its various forms. 


Unilateral and Bilateral Vocal Fold Paralysis 


Incomplete glottic closure during phonation because of 
unilateral or bilateral adductor vocal fold paralysis pro- 
duces variously aphonia, breathiness, hoarseness, exces- 
sively low volume, diplophonia, and high-pitched falsetto. 
Therapy is based on the principle of compensatory glottic 
closure by capitalizing on the primitive effort closure capa- 
bilities of the larynx. In unilateral paralysis, the normal 
vocal fold is encouraged to cross the midline and contact 
the paralyzed fold. In bilateral, both folds are forced toward 
the midline. Voice recovery depends upon the etiology of 
the paralysis and the position of the paralyzed vocal 
fold(s). Patients who have degenerative peripheral nervous 
system diseases are poorer risks than are those with static 
lesions. Those who have unilateral vocal fold paralyses are 
better therapy risks than are those who have bilateral 
paralysis. The closer the paralyzed vocal fold is to the 
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midline, the better the prognosis. Patients who have idio- 
pathic vocal fold paralysis may experience spontaneous 
return of voice. 


Voice Therapy 


Behavioral voice therapy for vocal fold paralysis is based pri- 
marily on the patient’s potential to compensate by means of 
adduction of the intact vocal fold. The following methods 
have found widespread support among clinicians: 


¢ Effort closure techniques. In Chapter 2 effort closure was 
described as a primitive reflex mechanism. Stronger glot- 
tic closure is facilitated by capitalizing on the effort clo- 
sure reflex by means of grunting, controlled coughing, 
laughing, pushing, and lifting. Linking the fingers and 
pulling in opposite directions while phonating the vowel 
/i/ and pushing against a table or pulling up on a chair 
synchronously with phonation are activities that were 
originally advocated by Froeschels, Kastein, and Weiss 
(1955). Vowels are introduced with the vowel /u/ having 
particular value in producing better voice. Progression 
from vowels to syllables, words, and sentences follows. 
Singing and humming are additionally helpful. Not all 
patients who have vocal fold paralysis require strenuous 
pushing, for many clinicians report that initiating vowels 
with an abrupt, sharp, or hard glottal attack is often 
sufficient. 


e Excessive musculoskeletal tension that may have devel- 
oped as a secondary reaction should be eliminated (see 
the Musculoskeletal Tension [Vocal Hyperfunction] sec- 
tion of this chapter). 


¢ A trial-and-error search should be conducted for a pitch 
and loudness level that will yield the best voice quality 
with the least physical effort. Pinching the wings of the 
thyroid cartilage together gently during phonation will 
bring the vocal folds into closer proximity, producing 
improved voice, which may then be repeated without the 
therapist’s help. Turning the head to the right or left may 
also improve the voice by increasing the tension on the 
paralyzed vocal fold. 


¢ Sensitizing the patient to improvements in voice during 
trial-and-error procedures should be emphasized. 
Moment-to-moment voice changes during therapy need 
to be pointed out to the patient instantly; otherwise, they 
will escape notice. 


¢ Attention should be paid to secondary personal and emo- 
tional problems arising from the voice disorder. One 
should provide time during each therapy session for dis- 
cussion of the patient’s voice experiences in everyday life, 
encouraging ventilation of frustrating experiences. It is 
important to be attentive to more severe emotional reac- 
tions, particularly depression, and to be ready to consult 
with appropriate mental health personnel. 


Phonosurgical Treatment 


An alternative for patients with unilateral vocal fold paraly- 
sis who do not respond to voice therapy is some type of 


phonosurgery as described elsewhere in this chapter. Of par- 
ticular interest to this population are medialization proce- 
dures and reinnervation. Although not yet tried in humans, 
gene therapy has been successfully used to treat laryngeal 
paralysis in rats (Shiotani, Saito, Araki, et al, 2006). 

Following are typical experiences with patients who have 
had vocal fold paralyses. 


Case Study 11.4 


A 45-year-old widowed mother of three children, a clerk in 
a department store, had no previous history of voice dis- 
orders, when she awoke one morning with marked 
hoarseness. 

After 2 weeks, she consulted a laryngologist, who 
found aright unilateral vocal fold paralysis. She had no his- 
tory of upper respiratory infections or trauma. Neck pal- 
pation and chest radiographs failed to reveal any masses. 
Remaining cranial nerves were normal. The laryngologic 
diagnosis was idiopathic unilateral vocal fold paralysis. 

Her voice was hoarse and diplophonic. Glottal attacks on 
vowels and coughing were weak. Peripheral speech muscu- 
lature examination failed to disclose any other abnormali- 
ties; neither did the motor speech evaluation. Her voice 
had not improved during the 2 weeks since onset, and she 
had taken leave from her job because of difficulty in com- 
municating with customers, especially in noisy settings. 
She became easily fatigued, both because of the increased 
vocal effort and because of the need to breathe more fre- 
quently owing to air wastage during phonation. She was 
having trouble talking to her three active children. 

A series of voice-therapy sessions was begun, stressing 
sharp glottal attack activities. She began to show reduced 
hoarseness and diplophonia while phonating at slightly 
higher pitch levels. However, when she increased her 
loudness level beyond a certain point at those pitch levels, 
diplophonia reappeared, so that therapy necessitated 
that she learn to modulate the loudness of her voice. A 
home-therapy program was outlined, and the patient was 
seen biweekly. During the first month of therapy, her 
voice underwent stepwise improvements, the first 
observable change being a disappearance of diplophonia. 
She returned to work at that point, enduring without 
undue concern the well-meaning remarks of fellow 
employees and customers concerning her voice. She 
managed to communicate with others by standing closer 
to them while speaking. 

Her voice continued to improve during the next 3 
months, and 4 months after the beginning of therapy, she 
was dismissed with mild residual breathiness. Loudness 
and pitch range had become acceptable, if not normal. 
The final laryngoscopic examination showed normal bilat- 
eral vocal fold movements. 

Spontaneous return of voice is common in patients 
with idiopathic unilateral vocal fold paralysis, although 
voice therapy may facilitate such recovery. The decision to 
begin therapy in this case was a precautionary measure 
predicated on the assumption that the patient might not 
have had spontaneous improvement. 
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An experience totally different from the above study is the 
following. 


On the pitfalls of injecting Teflon into mobile vocal folds 
such as for the correction of atrophy and bowing of the 


Case Study 11.5 


A 53-year-old housewife and cashier at a coffee shop 
emerged from anesthesia for thyroidectomy with a severely 
breathy-hoarse voice that did not improve during the 2- 
month postoperative period. She returned for laryngologic 
examination and was found to have a fixed left vocal fold. 
She returned to work a month after surgery but met with 
considerable difficulties in her job, being unable to make 
herself heard in the noisy restaurant and becoming increas- 
ingly self-conscious about the steady stream of comments 
from customers regarding the sound of her voice. She and 
her husband made no attempt to conceal their hostility and 
were determined to get help for the voice. 

The laryngologist, after explaining to the patient the 
unavoidability of the paralysis, and after demonstrating 
much concern over her welfare, referred the patient for 
voice therapy. On evaluation, she was breathy, bordering 
on aphonia. She broke down several times during discus- 
sion of the effects of her voice on her job and home life, 
stressing the embarrassment it caused her and how it 
made her feel self-conscious and “peculiar.” 

During 4 months of voice therapy, her voice failed to 
improve. Finally, one day the patient’s husband called, 
saying that his wife had gone on an extended leave of 
absence from her job and was showing signs of increasing 
depression and withdrawal. 

The patient was referred to a psychiatrist, who 
believed that the depression was a reaction to her illness, 
surgery, and voice disorder and placed her on antidepres- 
sant medication, agreeing to see her periodically for sup- 
port. Voice therapy was continued simultaneously, and 
although she appeared less depressed, her dysphonia 
failed to improve. 

Review of her case with the laryngologist led to the 
decision to do a Teflon injection of the paralyzed vocal 
fold. She underwent general anesthesia, during which the 
left vocal fold was injected. Upon recovery and reexami- 
nation 24 hours after surgery, her voice was virtually nor- 
mal in quality, loudness, and pitch. Her response was 
ecstatic, her depression had lifted almost instanta- 
neously, and comparison of pre- and postsurgical video- 
tape recordings showed an indescribable difference in 
facial expression. 

The patient returned to work immediately, and her hus- 
band reported that her mood had changed back to normal. 
Six-month and 1-year follow-up examinations showed con- 
tinuation of normal voice and personal adjustment. 

In contrast with the first patient, whose personality 
strengths and somewhat better voice enabled her to tol- 
erate her dysphonia, the second patient fared less well. In 
addition to her increasing depression over her voice loss, 
necessitating psychiatric treatment, her normal vocal fold 
gave little indication that it intended to compensate. The 
effects of the Teflon injection, affording a new voice and 
its concomitant emotional benefits, illustrate the virtue of 
medialization procedures. 


folds, Kaufman (1988) wrote: 


Teflon in the mobile vocal cord diffuses and produces 
a stony-hard cord. The resultant voice may be tem- 
porarily improved, but with the passage of time, the 
voice becomes very poor, indeed worse than prior to 
injection. Over the past 5 years | have attempted to 
surgically remove Teflon in six such cases, using the 
CO, laser. The diffusion of the Teflon makes this 
impossible, and the vocal improvement following 
removal is modest at best . . . no matter how appeal- 
ing it may seem to use that technique to correct such 
cordal defects, no available data supports that prac- 
tice. Furthermore, irreversibility once Teflon is intro- 
duced makes such injections ill advised . . . at present 
there is no surefire method to correct bowing of the 
vocal cords, least of all Teflon injection. 


Paradoxical Vocal Fold Dysfunction 


Paradoxical vocal fold dysfunction (PVFD) occurs when the 
vocal folds paradoxically close when they should be opening. 
Paradoxical vocal fold dysfunction (PVFD) occurs when the 
vocal folds paradoxically close when they shoud be opening. 
This condition is called by many names including paradoxical 
vocal cord dysfunction (PVCD), vocal cord dysfunction (VCD), 
irritable larynx syndrome (ILS), episodic paroxysmal laryn- 
gospasm, Munchausen's stridor, factitious asthma, and many 
other names. The etiology has been attributed to brain-stem 
problems, performance anxiety, redundant or floppy laryngeal 
tissue, vagal disturbances, and esophageal reflux (Mathers- 
Schmidt, 2001; Sandage and Zelazny, 2004). PVFD has 
occurred in an infant as young as 4 months and in the geriatric 
population though it is most common in adolescent athletes 
and women between the ages of 20 and 40. Symptoms may 
range from mild sporadic breathing problems to complete res- 
piratory obstruction. Approximately 50% of the individuals 
with PVFD also have asthma. Most paradoxical movements 
occur during inspiration, but in ~10% of the cases it occurs dur- 
ing expiration. With paradoxical breathing, regardless of etiol- 
ogy or age, patients often experience air hunger and then panic 
with a fear of impending death. The problem mimics asthma, 
and commonly patients are referred by allergists reporting 
that the patient has breathing problems, with or without 
accompanying hoarseness, that are not responsive to asthma 
medications. If pulmonary function tests have been ordered, a 
typical flat inspiratory flow volume loop is observed. When 
there is not a concomitant medical or psychologic problem, 
treatment is generally straightforward and successful. Patients 
are taught to inhale slowly through their nose to maximize 
glottal opening and exhale through a restricted oral aperture 
such as created with pursing the lips or producing a soft /s/. 
Once the patient is aware that he or she can control the airway, 
the focus of therapy is on teaching abdominal breathing and 
overall relaxation. In cases where the PVFD is exercise- 
induced or occurs only during an athletic event, treatment also 
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includes teaching breathing patterns that complement the 
sport and help reduce anxiety. In those instances when redun- 
dant tissue is the cause, surgical expiration of the tissue may be 
necessary. When anxiety is the primary cause, psychiatric 
referrals are necessary. 


¢ Therapy for Problem Voice Disorders 


The therapy principles and methods already described are 
for those voice disorders about which there is a moderate 
amount of knowledge and certainty. In this section are dis- 
cussed voice disorders about which less or little is known. 


Neurologic Voice Disorders 


Aside from therapy for unilateral and bilateral vocal fold 
paralysis resulting from lesions of the tenth cranial nerve, 
which are amenable to surgical and behavioral therapy, most 
voice disorders due to central nervous system diseases are 
resistant to modification. In the majority, the dysphonia does 
not exist alone but is a component of a dysarthria involving 
resonation and articulation as well. Respiratory weakness 
and incoordination are inseparable from these defects of 
phonation and contribute heavily to the voice disorder. 


Myasthenia Gravis 


Improvement of the breathiness, hoarseness, and insuffi- 
cient loudness associated with this disease is directly 
dependent upon alteration of the neurochemistry at the 
myoneural junction. Patients who respond successfully to 
the administration of anticholinesterase drugs or to thymec- 
tomy will correspondingly experience an improvement in 
their voices. The velopharyngeal insufficiency of myasthenia 
gravis can be considerably improved and even eliminated 
with the use of a palatal-lift prosthesis. Symptomatic voice 
therapy for patients on maintenance doses of anti- 
cholinesterase drugs has not been advocated because of 
questionable effectiveness; however, more knowledge is 
needed in this area, and the effects of voice therapy need to 
be clinically investigated. 


Pseudobulbar Palsy 


The strained-strangled hoarseness and excessively low pitch 
of spastic (pseudobulbar) dysphonia, a component of spastic 
dysarthria, can be a serious obstacle to speech intelligibility. 
The dysphonia is especially severe when accompanied by a 
flaccid component in amyotrophic lateral sclerosis. The accu- 
mulation of saliva in the pyriform sinuses of the larynx in 
amyotrophic lateral sclerosis worsens the dysphonia. 


Parkinsonism 


The breathiness, monopitch, and especially the inadequate 
voice volume can severely compromise speech intelligibility 
in this disease. In the not too distant past, the value of voice 
therapy was considered limited and effective primarily for the 


mildly impaired. Since the development of the LSVT, this is no 
longer the case. Many patients undergoing LSVT, drug therapy 
with L-dopa, and deep brain stimulation experience an 
improvement in their voices along with improvement of other 
signs of parkinsonism. A portable, body-worn sound amplifier 
has been used, as has a vocal intensity controller, with variable 
success, but as these feedback devices become increasingly 
sophisticated, this too is likely to change and to complement 
the other treatments. Some patients can become aphonic. 
Here, as in other very severe communicative disorders, con- 
sideration of a communication aid should be given. 


Essential (Voice) Tremor 


Although they suffer social embarrassment, patients who 
have mild to moderate essential (voice) tremor do not usually 
have an intelligibility problem. Voice therapy is frequently 
ineffective in reducing the tremor, which is unfortunate, 
because the tremor is socially restricting for many patients. 


Other Less Common Movement Disorders 


The waxing and waning laryngeal spasms and uncontrol- 
lable pitch changes, which are slower in the dystonias and 
faster in chorea, generally have not been treated successfully 
with symptomatic voice therapy. 


Gilles de la Tourette’s Syndrome 


The laryngeal components of this syndrome have been 
reduced and even eliminated with the administration of the 
drug haloperidol. The value of voice therapy has not been 
assessed. 


Apraxia of Phonation 


Muteness or aphonia as a component of apraxia of speech can 
be successfully treated by working from primitive, reflexive 
coughing and throat clearing into humming, vowel produc- 
tion, and, ultimately, word formation, the latter being depend- 
ent upon the severity of the apraxic articulatory component. 


Voice Disorders Due to Mass Vocal Fold Lesions 
Hypothyroidism 


The hoarse, low-pitched voice of hypothyroidism improves 
with the administration of thyroid replacement drugs 
administered for treatment of the total illness. The pitch of 
the voice increases and the severity of hoarseness decreases 
as the patient’s thyroxin level approaches normal. 


Neoplasms 


No rational disorder-specific therapy exists for benign 
lesions, and therapy depends upon individual circumstances. 
With this group of patients more than any other, trial-and- 
error therapy is the order of the day. Different pitch and 
loudness levels, head positions, and varieties of glottal attack 
all need to be tried to determine which modes of phonation 
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yield the most audible and aesthetically acceptable voice. 
Clinician and patient need to be realistic about accepting 
less-than-desired results. 

In patients who possess only one vocal fold because of 
removal of the other for carcinoma, and in others with severe 
atrophy, scarring, or other lesions of the vocal folds, ventric- 
ular phonation may yield a better voice than voice produc- 
tion with the true vocal fold. 

A serious problem is the child who is unable to phonate 
because of chronic laryngeal obstruction due to congenital or 
acquired laryngeal stenosis, papilloma, trauma, or subglottal 
stenosis. A substitute form of communication is mandatory. 
Children that have had glottic stenoses for long periods, pre- 
venting conventional phonation, often spontaneously 
acquire pharyngeal and buccal speech, gestures, and abnor- 
mal breathing patterns. Kaslon, Grabo, and Ruben (1978) 
advocate alternative approaches to communication, such as 
sign language, an artificial larynx, communication boards, 
and esophageal speech. The need for some form of substitute 
laryngeal sound is especially critical in the very young who 
are undergoing normal language development, for failure to 
provide substitute voice will almost assuredly result in 
delayed speech and language. 


Professional Singer and Actor Voice Users 


Singers and actors with injured voice present special chal- 
lenges to voice clinicians. By virtue of their profession, they 
are required to extend vocal ranges beyond normal use often 
causing tissue damage in the process. Treatment of singers is 
discussed in a separate chapter. There is not a separate chap- 
ter for the actor because most of the techniques used with the 
actor who has an injured voice are identical to those used with 
other patients with a couple of exceptions: they may need to 
be taught to yell and use unusual voice patterns without caus- 
ing damage to the tissue; and they may have greater need to 
work on voice and physical movements together, such as 
might be necessary in projecting a voice on stage during a 
sword battle. Actors also may have additional problems 
related to vocal hygiene that need to be addressed because of 
working in dusty environments or outdoors with needs to 
greatly project their voices; eating late at night after perform- 
ances; or needing to have second itinerate jobs. This suggests 
that speech-language pathologists working with this special 
population, and in some cases working with their voice 
coaches as well, need to know a lot about anatomy and physi- 
ology and how various sounds are produced. Clinicians work- 
ing with actors also need knowledge of the techniques that 
are used in theater voice training so that they can work better 
with voice coaches and directors to help modify instruction 
and performance as needed to produce a healthy voice. 
Among the numerous theater voice training luminaries, three 
names that stand out are Alexander, Lessac, and Linklater. The 
Alexander Technique addresses habits of posture, movement, 
and relaxation to eliminate inappropriate tension contribut- 
ing to vocal problems (Conable, 1992). Lessac, as described by 
Verdolini (1998) “. . . focuses on increasing awareness of 
potentially numbing habit, then changing them to habitual 
awareness and active relaxation. Lessac focuses on three 
actions: structural action, tonal action and consonant action. 
Structural action relates to lengthening the vocal tract and 


releasing tension from the mouth and face to promote easy, 
free voice. Tonal action focuses on the physical sensations of 
vibrations in the bones of the palate and skull, creating sound 
that projects effortlessly and without the likelihood of vocal 
injury. Consonant action assigns a musical instrument, along 
with its vibratory characteristics, to each consonant sound. 
The speaker explores speech sound in such a light, and then 
brings the fruits of this exploration to the act of communica- 
tion. Used together in various concentrations, the three 
actions can lead a speaker to a wide vocal variety, creativity 
and flexibility.” Similarly, Linklater’s work on breath and head 
and neck awareness center around releasing unwanted ten- 
sion to free the voice (Linklater, 1992). Whereas voice coaches 
using these techniques often take years of coursework to learn 
the intricacies of working with actors to bring about the best 
voice, others with less instruction may also use the technique 
and benefit from working with speech clinicians who have 
had more background in behaviors that may result in vocal 
abuse. Adaptations of these techniques also have been used 
directly by speech clinicians working with voice disorders. 


Transgender Therapy 


The majority of individuals who go through gender reassign- 
ment procedures are males changing to females (ratio is 4:1). 
Neither hormones nor genital surgery bring about a feminine 
voice (Kaira, 1977). Male to female transsexuals believe their 
voice to be the greatest obstacle to social integration and to 
succeed changing their identity on completion of their sexual 
reassignment (Berger, 1988; Neuman, Welzel, Gonnermann, 
et al, 2002). Therefore, voice treatment is of paramount 
importance to this population. Voice treatment for transgen- 
der individuals may involve surgery, behavioral modification 
of communication, or a combination of the two. 

If the fundamental frequency of an individual falls below 
180 Hz, the speaker is likely to be identified as a male. If the 
speaker appears to be a female but sounds like a male, the 
voice will call attention to itself. Similarly, if a person’s fun- 
damental frequency is above 195 Hz, the individual most 
often will be identified as a female, particularly if it is accom- 
panied by breathiness. The frequency can be surgically 
raised by either reducing the size of the vocal folds or 
increasing the tension. Tension is increased with cricothy- 
roid approximation. Reduction in the size of the vocal fold is 
accomplished by creating an anterior glottic web to reduce 
the length of the vocal folds. In isolation, surgery does not 
necessarily result in a feminine voice because the overall 
mass of the vocal fold contact area may not be sufficiently 
changed. Thus, there is more to feminization of voice than 
changing the fundamental frequency. 

Feminization of speaking generally involves a multiprong 
approach (education, identifying and extending best pitch 
range, increasing breathiness, inflectional pattern work, artic- 
ulatory pacing, adjusting language content, and feedback). 
Education begins with a discussion of vocal fold vibration and 
demonstrations from movies of individuals who have believ- 
ably portrayed different gender roles without changing their 
fundamental frequency. For example, Dustin Hoffman in the 
role of Tootsie and Robin Williams in the role of Mrs. Doubtfire 
both accomplish this feat successfully. Improving range is 
often accomplished by a combination of daily practice of 
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singing and using lip trills up and down the pitch range. 
Emphasis is placed on achieving a relaxed vocal tract with a 
wide range of naturally sounding pitches rather than the high- 
est that can be attained, as the higher falsetto levels often 
sound artificial and with other changes in communication 
often are not necessary. The goal is to obtain good resonance 
balance across a range of pitches that do not sound like a male 
imitating a female. Increasing the breathy quality generally 
increases the feminization of the voice. This can be done by 
substituting soft attacks for hard glottal attacks and request- 
ing that the speaker use a softer voice. Also helpful in increas- 
ing breathy quality is the use of feedback of either visually 
monitoring increased opening and change of laryngeal pos- 
ture or providing visual feedback of airflow. Articulatory pac- 
ing focuses on slowing down speech production. Women gen- 
erally enunciate more clearly and speak more slowly than do 
men. Memorizing lines from movies and shadow practicing 
with favorite female stars provides good practice material for 
this prong of the treatment. Less obvious to many speakers is 
the need to work on feminization of content (Hooper, 1985). 
This may include work on modifying body posture and hand 
gesture, vocabulary and sentence structure, pragmatics, and 
dress. In her summary of a voice therapy case of a transsexual, 
Kalra (1997) summarizes the eleven suggestions for feminiza- 
tion of the voice developed by transsexuals and first published 
by the Looking Glass Society (1997). These self-help suggestions 
are: (1) sing; (2) raise the position of the laryngeal cartilage; 
(3) partially open the glottis when speaking; (4) emphasize 
pitch, not volume; (5) speak slowly; (6) pace speech care- 
fully; (7) use appropriate content; (8) pay attention to 
tongue position (9); hold mouth in a slight smile with 
rounder lips and good lip movement; (10) develop head res- 
onance by opening mouth more and pushing voice into the 
head; and, (11) use feedback from recording samples. Elabo- 
ration of these suggestions are found on the Looking Glass 
Society Web site. Finally, reviewing video-recorded samples 
of the speaker while reading, conversing with the clinician, 
and practicing new speech patterns over the telephone pro- 
vides an opportunity to further shape and feminize commu- 
nication skills. 


¢ Summary 


¢ The objective of voice therapy is to return the voice to a level 
of social and occupational adequacy within the patient’s 
anatomic, physiologic, and psychologic capabilities. 


¢ Voice therapy revolves around physiologic principles 
designed to restore normal biomechanics to the vocal 
apparatus. In attempting to restore voice, underlying 
pathology may be revealed as voice problems are often 
multifactorial. Restoration of the mechanical properties 
are only part of the treatment. Therapy is also tailored to 
the voice and personality of the patient and pays atten- 
tion to emotional as well as the mechanical and acoustic 
aspects of abnormal voice. 


e General principles of voice therapy are blended in with 
the principles and techniques devised for specific voice 
disorders. 


Voice therapy for excess laryngeal musculoskeletal ten- 
sion is based on the principle of muscle tension reduction 
and is accomplished by kneading the laryngeal muscula- 
ture, reducing the elevated position of the larynx, encour- 
aging general relaxation, and alleviating environmental 
stress. 


+ Voice therapy for vocal abuse is based on the principle of 
patient reeducation to use the voice more judiciously in 
everyday life and to reduce those stress factors that drive 
the patient into patterns of vocal misuse. These principles 
are implemented by increasing patient awareness of 
abnormal modes of phonation, teaching modified voice 
use, and discussing sources of emotional tension. 


+ Conversion aphonia and dysphonia are treated according 
to the principle of reestablishment of the patient’s con- 
scious awareness of greater phonatory capability and dis- 
cussing the emotional conflicts that have generated the 
voice disorder, continuously monitoring the possibility of 
referral for deeper psychotherapy. These principles are 
implemented by musculoskeletal tension reduction, evo- 
cation of increased glottal activity, and encouragement of 
patient awareness and disclosure of interpersonal con- 
flicts and life stress. 


+ Therapy for mutational falsetto is based on learning to 
inhale with increased air volume and to exhale with 
increased force simultaneous with a sharp glottal attack. 
Laryngeal re-posturing and speaking with increased oral- 
ity and loudness help facilitate a lower fundamental 
frequency. 


+ Voice therapy for adductor spasmotic dysphonia has lim- 
ited value. Phonation with a breathy vocal attack and at 
elevated pitch can yield reduced laryngospasm under less 
socially pressured speaking conditions. Botulinum toxin 
injections reduce symptoms and provide temporary relief 
but necessitate repeated injections. When combined with 
voice therapy, the length between treatments is extended. 
Laryngoplasties have been used with varying degrees of 
success. Recurrent laryngeal nerve procedures also aid in 
alleviation of this disorder, but their long-term value is 
unpredictable. 


+ Therapy for unilateral vocal fold paralysis is based on 
compensatory glottic closure by the intact vocal fold. 
Therapy for adductor bilateral paralysis is based on resid- 
ual adductor vocal fold ability. Both are implemented by 
effort closure exercises through phonation with a sharp 
glottal attack during lifting, pushing, and grunting activi- 
ties. Glottic insufficiency due to vocal fold bowing and 
disuse are treated similarly. Vocal function exercises and 
Lee Silverman Voice Therapy also appear to improve clo- 
sure and function in these populations. 


Several voice disorders resulting from central nervous 
system disease and vocal fold mass lesions are minimally 
responsive to voice therapy; further research is necessary. 


+ Voice therapy literature that is evidence-based is lacking. 
Clinicians must be vigilant in their search for verified effi- 
cacious treatments. Treatment efficacy is determined by 
clinical, functional, financial, social, and patient-defined 
outcomes. 
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Chapter 12 


Special Considerations for the Professional Singer 


Brian E. Petty 


The term professional voice users is used to denote the seg- 
ment of the population for whom the voice is used as a pri- 
mary tool of their occupation. In the United States, this term 
applies to a staggering 25 to 35% of the national workforce 
and includes (but is not limited to) such disparate profes- 
sions as teachers, clergy, radio and television broadcasters, 
salespeople, politicians, aerobics instructors, auctioneers, 
cheerleaders, actors, and attorneys (Titze, Lemke, and Mon- 
tequin, 1997; Wingate, Brown, Shrivastav, Darenport, and 
Sapienza et al., 2007). Professional singers are a highly visi- 
ble and specialized subset of individuals within the popula- 
tion of professional voice users and often present an 
unusual challenge for physicians and speech-language 
pathologists charged with their care. Although differences 
exist regarding the specific definition of the term profes- 
sional singer (VanEaton, 1990), for the purposes of this chapter 
the term will be defined liberally as singers who routinely 
receive payment for their performances. This chapter will 
discuss the occupational requirements faced by professional 
singers, identify specific concerns to be addressed during 
assessment, and make recommendations regarding treat- 
ment and continuing education. Although some of the infor- 
mation at first glance may appear redundant to information 
provided in other chapters, a closer look reveals the adapta- 
tion and elaborations needed to properly assess and treat 
this special population. Furthermore, whereas professional 
actors have very specific diagnostic and treatment concerns 
related to their profession, some portions of this discussion 
about professional singers may be extrapolated to the acting 
population as well. 

The functional demands that singers place on the laryn- 
geal mechanism in terms of frequency range, amplitude 
control, acoustic variation, and overall vocal stamina are 
significant and unique. Because these specific demands 
vary between singers of various musical traditions and abil- 
ities, the risk of vocal impairment or injury is also variable 
depending on such factors as preferred musical style and 


performance practices, history of vocal study, general 
health status, and other influences affecting vocal hygiene 
(Koufman, Radomski, Joharji, et al, 1996). Singers may be 
significantly impaired by subtle laryngeal changes that may 
be considered to be within the normal range of variability 
within the general population. Conversely, the well-trained 
singer will also possess a high level of neuromuscular 
laryngeal control that may lessen (or even eliminate) audi- 
ble symptoms of some laryngeal pathologies (Elias, Sat- 
aloff, Rosen, et al, 1997; Lundy, Casiano, Sullivan, et al, 
1999). Therefore, it is imperative that singers receive expe- 
rienced multidisciplinary medical care from practitioners 
that are knowledgeable of the functional vocal require- 
ments of singers and sensitive to the particular emotional, 
physical, financial, and professional concerns that singers 
may experience as a result of vocal impairment (Sataloff, 
2005). 

A comprehensive voice evaluation for a professional 
singer should include a carefully collected and thorough 
case history (including medical, behavioral, and social infor- 
mation), acoustic and aerodynamic data as appropriate, and 
visualization of the larynx using rigid and/or flexible 
fiberoptic endoscopy under halogen and stroboscopic xenon 
light sources (high-speed laryngoscopy can often be a useful 
tool if this technology is available). The team should com- 
prise qualified speech-language pathologists, singing teach- 
ers, and otolaryngologists who are experienced in treating 
professional singers. Many professional singers who per- 
form in metropolitan areas prefer to live and raise their 
families in smaller communities where access to such spe- 
cialized care is more limited. Because the concerns of the 
professional singer are unusual and highly specific to their 
special vocal demands, it is important that they be seen by 
health care providers who have experience in treating this 
population. If these specialty services are available within a 
reasonable travel distance, it is well worth the expense and 
effort. 
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+ Gathering a Comprehensive 
Case History 


When a singer’s voice begins to exhibit symptoms of dys- 
phonia, other singers and colleagues are often the first peo- 
ple to notice the problem and offer advice. If the larynx were 
like a violin, something that the musician could see, touch, 
and repair, there might not be as much mythology and lore 
associated with how to take care of it. The traditional meth- 
ods of singing instruction did not emphasize a comprehen- 
sive understanding of laryngeal anatomy and physiology, 
though in recent years this has become a more widely 
addressed component of academic music programs. Many 
remedies that are considered traditional among professional 
singers have been either proved to be ineffective (Roy, Tan- 
ner, Gray, et al, 2003) or have been shown to have adverse 
effects. It is therefore important to take a thorough case his- 
tory, including a detailed description of the remedies that 


Table 12.1 Summary of Basic Musical Terms 


the patient has tried prior to the appointment. Samples of a 
comprehensive case history form for singers have been pub- 
lished previously (Sataloff, 2000) and may serve as an excel- 
lent template for data collection. 

One of the first things that a health care provider must do 
before beginning to treat professional singers is to become 
familiar with some of the terms that singers use in profes- 
sional conversation that may not be familiar to nonmusi- 
cians. This may seem trivial at first, but displaying poor 
knowledge of the singer’s professional parlance may make it 
more difficult for the patient to development a feeling of 
trust and confidence. Michael Trudeau, Ph.D., CCC-SLP, once 
told a story of a singer who presented to a voice clinic stating 
that “my voice lacks brilliance,” a statement that yielded 
blank stares from the physician and speech pathologist in 
attendance. Some simple familiarity with basic terminology 
will establish rapport and confidence. To that end, Table 12.1 
is asummary of basic musical terms, some of which are con- 
tained in this chapter. 














Appoggio The dynamic balance between the inspiratory and expiratory forces in singing. 

Aria An elaborate composition for solo voice with instrumental accompaniment. In opera, the aria represents lyric episodes 
that temporarily relieve the dramatic tension of the action. 

Arpeggio The notes of a chord played one after another instead of simultaneously. 

Chest voice The vocal register associated with connected speech, or “modal register.” 

Coloratura Arapid passage, run, trill, or similar virtuoso-like material, particularly in vocal melodies of 18th and 19th century operatic 
arias. 

Crescendo Gradual increase in loudness. 

Decrescendo Gradual decrease in loudness. 

Dress rehearsal The rehearsal immediately preceding the first performance of a theatrical production. 

Fioritura Melodic embellishment, either written out or improvised. See ornamentation. 

Forte Loud. Notated in musical scores as “f” 

Fortissimo Very loud. Notated in musical scores as “ff.” 

Green room A backstage lounge or dressing room used by performers. 

Head voice The upper singing range, which may or may not be used synonymously with falsetto. 

Legato Singing (or playing) without any perceptible interruption between the notes. 

Melisma Aterm originating in medieval Gregorian chant but now used to refer to an expressive vocal passage sung to one syllable, 
as opposed to the virtuoso-like and frequently stereotyped coloratura. 

Messa di voce Aspecial vocal technique of 18th century music, consisting of a gradual crescendo and decrescendo over a sustained tone. 
Modern singers use it extensively for training but sparingly in performance. 

Mezza voce “Half voice,” or singing with restrained volume of tone. 

Mezzo forte Medium-loud. Notated in musical scores as “mf.” 

Mezzo piano Medium-soft. Notated in musical scores as “mp.” 

Opera A combination of solo vocal, choral, orchestral, and theatrical performance involving music, drama, poetry, acting, dance, 
stage design, and costuming. 

Oratorio A composition for solo voice, chorus, and orchestra or of religious or contemplative character that is performed in a 
concert hall or church without scenery, costumes, or action. 

Passaggio Italian word for “passage,” used in reference to the transition between vocal registers. 

Piano Soft. Notated in musical scores as “p.” 

Pianissimo Very soft. Notated in musical scores as “pp.” 

Ring The perceptual term used in reference to the singer’s formant. 


Singer’s formant 
Staccato 
Tech rehearsal 


Vibrato 


Astrong band of spectral energy found between 2500 and 3300 Hz, associated with a resonant, “well-placed” singing 
voice. 

A manner of performance indicated by a dot (or, less frequently, the sign) placed over the note, calling for a 50% 
reduction of its written duration. 

The penultimate rehearsal, not usually costumed, during which the final technical aspects of the performance are 
finalized. 

Ascarcely noticeable wavering of the pitch during singing. 


Source: Apel, W., Harvard Dictionary of Music. End edition, revised and enlarged. Eighth printing (1974). Cambridge, MA: Belknap Press of Harvard 
University Press. Permission pending. 
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Subtlety of Symptoms 


Because professional singers may be more attuned to subtle 
changes in their own vocal quality compared with non- 
singers, they will often present to the clinic reporting symp- 
toms that are not initially audible in connected speech tasks. 
Common concerns among professional singers (and amateur 
singers as well) usually involve issues related to vocal fold 
closure, pliability, and/or symmetry. The symptoms associ- 
ated with these areas include difficulty with vocal projection 
(including vocal fatigue), increased vocal effort during quiet 
singing, changes to the patient’s pitch range, difficulty sus- 
taining longer musical phrases, and/or altered perceptual 
vocal quality (Dailey, 2005). For the classical or operatic 
singer, instability in the transition between the modal or 
“chest” voice and the upper range or “head” voice may be 
described, resulting in pitch breaks, quality disruption, 
and/or decreased ability to project the voice in that range. 
The production and manipulation of the notes within the 
transition register (also called the passaggio, or “passage” in 
Italian) is one of the more challenging functional goals faced 
by the singer during formal singing training. When the 
laryngeal instrument is working optimally and if the singer’s 
technical expertise is well established, the result is perceived 
as an even, consistent vocal quality throughout the singer’s 
range, without audible transition from chest to head regis- 
ters. Difficulty negotiating the passaggio is often one of the 
first symptoms reported by singers during the onset of early 
dysphonia. 


Voice Use Patterns Not Related to Performance 


The standard case history questions regarding symptoms, 
time and type of onset, progression of the symptoms since 
onset, and activities or other factors that influence the symp- 
toms will most certainly apply to the population of profes- 
sional singers. However, it is important to obtain more 
specific data to understand fully the nature, etiology, and 
effect of the dysphonia. One of the first issues to address 
would be the patient’s overall voice use behaviors, during 
singing engagements as well as nonmusical activities. Many 
voice problems in singers are the result of nonmusical voice 
misuse or tissue changes related to other medical issues. 
Does the patient have another job to supplement his or her 
music-related income? If so, does that position require a sig- 
nificant amount of voice use (such as sales positions, call 
center representatives, etc.)? Does this additional job take 
place in an environment that would be irritating to the larynx 
(such as smoky bars, restaurants, construction sites, etc.)? 
Many professional singers teach singing themselves, plac- 
ing yet more demands on the laryngeal mechanism. If the 
patient is a singing teacher as well, how many students will 
he or she teach in one day? Does the patient often demon- 
strate for the students by singing him- or herself? How is the 
studio (teaching space) arranged? Is it ergonomically sound, 
or is the patient frequently positioned awkwardly in an 
effort to see the student and play the piano simultaneously? 
What is the patient’s social voice use like? Would the patient 
describe self- as “extremely talkative”? Does he or she they 
go to entertainment venues that require speaking in noisy 


environments (such as bars, clubs, sporting events, etc.)? 
Does the singer frequently use “character voices” when read- 
ing to children, speak for extended periods on the telephone, 
or frequently raise the voice in anger? Asking multiple open- 
ended questions regarding voice use patterns outside of the 
performance venue will provide much more insight into the 
patient’s overall voice use profile and may illuminate signif- 
icant etiologic factors. 


Medications 


Part of any comprehensive case history involves obtaining a 
list of the patient’s current medication regimen. Many med- 
ications produce a drying effect on the vocal folds, which 
results in increased phonation threshold pressures and a 
sensation of increased vocal effort. Medications with drying 
effects include antihistamines, some acne medications (such 
as Accutane® [isoretinoin] Roche Pharmaceuticals, Basel, 
Switzerland), 8-blockers, diuretics, diet pills, and high doses 
of vitamin C. Consultation with the prescribing physician 
often results in a change to an alternative medication with 
milder drying effects (Sandage and Emerich, 2002). 

Environmental and seasonal allergies are extremely com- 
mon among the general population and can be diagnosed at 
any time during the lifespan. Touring singers may be 
exposed to novel allergens and subsequently experience 
allergy symptoms that are geographically specific. Although 
many allergy medications have a drying effect on the vocal 
folds, it is important to ensure that the airflow through the 
nose and sinuses is relatively unhindered to help facilitate 
adequately balanced oral-nasal resonance. Thus it is crucial 
to adhere to the allergist’s recommendations regarding con- 
sistent allergy maintenance, even if the patient is initially 
unenthused about taking medication. 

Further environmental factors that may affect the per- 
forming artist include substances to which performers are 
exposed in the workplace. If the patient’s onset of voice dif- 
ficulties coincided with the start of rehearsals or perform- 
ance for a particular production, he or she should be asked 
questions regarding the types of sets and props used. For 
example, Richter and colleagues showed that singers, actors, 
and other performing artists are occasionally exposed to 
ambient stage conditions that contain irritating agents such 
as aromatic diisocyanates used to create a mound as part of 
the production’s set, spore-producing fungi contained in 
cork granulate, cobalt and aluminum used in paint pigments, 
and fine quartz sand capable of entering the alveolae 
(Richter, Löhle, Knapp, et al, 2002). Recent investigations 
into the harmful consequences of some theatrical special 
effects have resulted in restrictions being placed by perform- 
ing artists’ unions (such as Actors Equity) on the use of such 
effects as glycol-based theatrical smoke (Pendergrass, 1999; 
Rossol and Hinkamp, 2001; Teschke, Chow, van Netten, et al, 
2005). 

Inhaled medications, often used for treatment of asthma, 
can cause edema and erythema of the vocal folds, resulting 
in increased stiffness of the tissues and a subsequent 
increase in perceived vocal effort (DelGaudio, 2002). The 
proper diagnosis of asthma in a singer is necessary to avoid 
unnecessary medications. To optimize pulmonary support 
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for the singing voice in the presence of this condition, collab- 
oration with the pulmonologist is essential in helping the 
patient to identify the best medication and the minimal 
effective dose to minimize these effects. 

The larynx is a gracefully balanced instrument that is 
markedly sensitive to endocrine fluctuations in the body. 
Some female singers report vocal quality changes just prior 
to starting their menstrual cycle (Davis and Davis, 1993), 
including thickened laryngeal secretions with subsequent 
throat clearing, vocal fold edema and stiffness, and increased 
vocal fatigue. Abitbol and colleagues showed similar thick- 
enings of the endometrial and laryngeal mucosa that are 
much more pronounced during the premenstrual period, 
particularly during high-range singing tasks. Accompanying 
these changes are an increased viscosity and acidity level of 
the laryngeal secretions, laryngeal edema, and exacerbation 
of microvarices, resulting in perceptual reports of vocal 
fatigue, a truncated upper range, decreased ability to pro- 
duce pianissimo notes, effortful vocal projection, and 
acoustic changes resulting in a “metallic” or “husky” vocal 
quality (Abitbol, Abitbol, and Abitbol, 1999). Singers may 
also report voice changes related to menopause (Abitbol and 
Abitbol, 1998), including a decreased upper range, changes 
to modal range quality, or decreased control of the vibrato 
(the gentle variation of pitch during sustained sung tones 
that many classical singers consider to be an inherent by- 
product of effective singing technique). Under the direction 
of an experienced endocrinologist, hormone replacement 
therapy may be indicated for restoring the characteristic ring 
(or singer’s formant) to the classically trained female singing 
voice after menopause, depending on the patient’s age, fam- 
ily history, and other risk factors (Abitbol, Abitbol, and Abit- 
bol, 1999). 

Patients with acute laryngitis occasionally present with a 
request for a prescription from the otolaryngologist for a 
corticosteroid such as methylprednisolone (Medrol® [methyl- 
prednisolone] Pfizer, Inc., New York, NY). Whereas corticos- 
teroids can be quite useful in obtaining prompt suppression 
of laryngeal inflammation, they should always be considered 
carefully. Corticosteroids can interact with some medica- 
tions such as antibiotics and oral contraceptives (Davis and 
Davis, 1993), producing an increased risk for side effects, 
including salt and water retention and decreased glucose 
tolerance. Increased vascular fragility from corticosteroid 
use can place some singers at increased risk for vascular 
injury, including hemorrhage. Discussions regarding the 
patient’s voice type (or fach if they are a classical singer) and 
specific performance demands will help to illuminate the 
patient’s likelihood of developing deleterious consequences 
from corticosteroid use. In all cases, the duration of the cor- 
ticosteroid course should be as short as possible. The patient 
who presents with insistent requests for corticosteroids 
should be referred to the otolaryngologist to discuss the seri- 
ous side effects of this powerful class of drugs. The decision 
to prescribe or not prescribe corticosteroids should be made 
after an in-depth cost-benefit analysis weighing the risks of 
the medication against the possible professional or financial 
impact of cancellation or substandard performance. 

The Web site maintained by the National Center for Voice 
and Speech (www.ncvs.org) offers concise summaries of 200 


commonly prescribed medications as well as herbal supple- 
ments and their effects on the voice. 


Laryngopharyngeal Reflux Symptoms 


Gastroesophageal reflux disease (GERD) is one of the most 
common ailments in the general population (Isolauri and 
Laippala, 1995) and is a familiar term to many patients. It is 
important, however, to emphasize the distinction between 
gastroesophageal and laryngopharyngeal reflux (LPR). Kouf- 
man discerned that complaints of throat clearing were sig- 
nificantly more common and complaints of heartburn were 
significantly less common among patients with LPR com- 
pared with patients with GERD (Koufman, 1991). Although 
laryngeal irritation secondary to LPR is common in at least 
50% of patients presenting for evaluation of hoarseness 
(Koufman, Amin, and Panetta, 2000), the effects of this irri- 
tation will be more significant for professional singers due to 
the nature of the demands they place on the laryngeal mech- 
anism. Although many patients will deny LPR because of the 
absence of overt heartburn, careful questioning will often 
reveal several more subtle symptoms consistent with this 
condition. These symptoms include (but are not limited to) 
heartburn, hoarseness, dry mouth or throat (despite ade- 
quate systemic hydration), chronic dry cough or throat clear- 
ing, “morning” voice, nocturnal coughing, persistent and/or 
thick phlegm in the throat, prolonged warm-up time (espe- 
cially in the morning), a burning sensation in the throat, a 
sensation of postnasal drip, occasional regurgitation (some- 
times informally described as “hot burps” or “mini throw- 
ups”), globus pharyngeus (a “lump” sensation in the throat), 
and indigestion or “sour stomach.” 

Gastric reflux is closely associated with athletics and 
other intense physical activity and is often linked to the 
length and intensity of the activity (Jozkow, Wasko-Czop- 
nik, Medras, et al, 2006). Placing aside the stereotypical 
images of the rotund opera singer, it is not surprising to note 
that professional singers and athletes are similar in this 
respect. Singers are, by their very nature, more susceptible 
to LPR because of the prolonged intraabdominal pressures 
associated with singing (Cammarota, Elia, Cianci, et al, 2003). 
Furthermore, the lifestyle of the traveling performer puts 
them at an even higher risk because of the combination of 
strenuous, prolonged physical exertion during performances 
followed by relatively late-night meals of often-dubious 
nutritional value. Therefore, a thorough understanding of 
the behavioral, dietary, and environmental risk factors asso- 
ciated with LPR is critical to the process of gathering a com- 
prehensive case history. The dietary factors credited with 
exacerbation of reflux include fried foods, spicy foods, red 
wine, citrus fruits and juices (including pre-performance 
lemon juice), tomato-based products (including sauces, 
soups, and juices), alcohol, caffeine, carbonated beverages, 
nuts, mint/menthol, and chocolate. Behavioral and environ- 
mental exacerbation factors include late eating, obesity, 
tight clothes or waistbands (including control-top panty- 
hose), eating shortly before engaging in strenuous exercise 
(including performances), and smoking. Elimination or 
moderation of these factors has been shown to be effective 
in ameliorating symptoms of LPR, independently from 
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medical or surgical intervention (Steward, Wilson, Kelly, 
et al, 2004). 


Emotional Concerns 


When discussing the patient’s dysphonia, it is important to 
understand that there is often a significant connection 
between the singer’s voice and his or her self-concept. 
Singers, especially those who are also singing teachers, will 
often express feelings of self-blame when they present to the 
voice clinic, with the assumption that they “should have 
known better” or have become “lazy with technique.” These 
self-deprecating feelings are often alleviated by the results of 
the examination, but it is important to acknowledge them 
and reassure the patient that occasional feelings of guilt are 
normal sensations among their peers. 

Whereas hypochondriasis is rare in professional singers, 
depression and anxiety are common diagnoses in up to 50% 
of the general population (Kessler, McGonagle, Zhao, et al, 
1994). And although professional singers are indeed a spe- 
cialized population, they are most certainly affected by psy- 
chosocial stressors that have an impact on members of the 
general population, including anxiety and stress related to 
social and romantic relationships, financial planning, job 
security, child welfare, and so forth. It is important to adhere 
to a holistic patient care paradigm by ensuring adequate 
consideration of psychosocial factors when indicated. Pro- 
fessional singers and students of singing, it must be realized, 
are just as susceptible to laryngeal dysfunction associated 
with psychogenic factors as is the nonsinging population. 
Therefore, as part of case history taking, the speech-language 
pathologist absolutely must administer a thorough, probing 
psychodiagnostic history to determine the presence of inter- 
personal conflicts involving persons important in their lives, 
parents, spouses, employers, or anyone else who arouses 
feelings of anger, anxiety, or depression, producing elevated 
laryngeal-hyoid musculoskeletal tension (vocal hyperfunc- 
tion) or even conversion disorder. 

The psychosocial history is crucial to a correct diagnosis and 
appropriate therapy (see Chapters10 and 11) but is not often 
addressed because of a lack of training and reluctance to 
invade that territory of the patient’s life. However, when a 
conflict is addressed by both parties and brought out into the 
open, the singing difficulty will be much more easily resolved. 

I am reminded of a 22-year-old aspirant to an operatic 
career who found she was unable to practice at home because 
her father, a frustrated and unsuccessful singer himself, was 
so jealous of her talent that he sabotaged all her efforts to 
practice at home. The patient subsequently developed an 
inability to sing at her full pitch range. When first asked about 
emotional obstacles in her life, she denied vehemently any 
such causal factors, but after repeated inquiries into this area 
she broke down and admitted her father’s involvement, hav- 
ing been reluctant to reveal such conflict fearing disloyalty to 
him. After family therapy, her pitch control problems abated. 


Voice Study and Dysphonia History 


During the process of gathering a comprehensive case history, 
it is important to identify what kind of singing training the 


patient has had. If the patient has studied singing, then he or 
she should be prompted to report how long the study lasted, 
when it occurred, and with whom the patient studied. The 
patient who underwent formal singing training for 6 years 
with an experienced teacher at a prominent conservatory with 
cessation of training a year or two ago will have a much differ- 
ent understanding of his or her instrument than a patient who 
studied singing for 2 months with a church choir conductor 10 
years ago. Additionally, the patient should identify whether or 
not the voice study was effective in facilitating a solid singing 
technique, according to the patient’s own perception. Making 
this distinction should certainly not imply that formal training 
is necessary for one to be a successful and healthy professional 
singer, but gathering this information will assist the health 
care provider in planning any future intervention. 

If the patient is actively studying voice at the time of the 
evaluation, and if the patient is comfortable providing the 
information, it is important to obtain the name of his or her 
singing teacher and vocal coach. Traditionally, the role of the 
singing teacher is to facilitate understanding and develop- 
ment of singing technique, with special consideration given to 
dynamic range, artistic quality, and vocal endurance. The 
vocal coach is often an experienced pianist serving as a collab- 
orative artist and also instructs the singer in language/ diction, 
acting/character development, and other forms of perform- 
ance practice. Collaboration with the singing teacher and 
vocal coach ensures that the information being presented to 
the patient is consistent, helps facilitate carryover of thera- 
peutic techniques to functional singing tasks, and establishes 
a mutually beneficial interdisciplinary relationship between 
the professions. The American Speech-Language-Hearing 
Association (ASHA), in conjunction with the National Associa- 
tion of Teachers of Singing and the Voice and Speech Trainers 
Association, have created a position statement codifying the 
important roles of these skilled professionals in the process of 
voice habilitation, which should be read carefully before 
beginning the collaborative process (Sandage M (ED), Graves- 
Wright J., Heuer R., Verdolini K., Ferketic M., Westerman- 
Gregg J., Titze I., DeVore K., Raphael B., and Hooper C., 2005). 


Singing Style 


Certain vocal styles have been shown to be associated with 
higher amounts of muscle tension than others, resulting in 
an increased incidence of dysphonia. Choral music, art song, 
and opera have been linked to lower amounts of muscle ten- 
sion than have been jazz/pop, blues, bluegrass/country/ 
western, and rock/gospel (Koufman, Radomski, Joharji, et al, 
1996). It is important to note that many singers perform in 
more than one style, switching from opera to choral music to 
jazz depending on the type of venue and the needs of their 
employer. Even within those styles associated with low 
amounts of muscle tension, singers may experience chal- 
lenges when transitioning between styles. For example, 
opera singers may find it difficult to sing in choirs, reporting 
that they feel a sensation of “tightness,” “constraint,” or per- 
ceptual respiratory support changes as a result of the singer’s 
attempt to “blend in” with the other voices and not “stick 
out.” Another singer who performs in the bluegrass or gospel 
traditions may report this this professional culture 
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encourages performers to sing with hyperfunctional tech- 
nique. He or she may report that “if I’m not hoarse at the end 
of the performance, then I wasn’t really trying.” For both 
groups of these singers, it is important to approach therapeu- 
tic intervention with the understanding that treatment will 
be provided within the context of the musical style in which 
they are expected to perform. Even within styles that are con- 
sidered somewhat hazardous from a vocal misuse standpoint 
(rock, country/western, etc.), there still exists a behavioral 
margin of improvement that can decrease vocal fatigue and 
increase perceptual vocal quality. Careful application of tech- 
niques learned in behavioral voice therapy, in collaboration 
with the singing teacher (if applicable), will facilitate 
improved performance outcomes and patient comfort while 
still respecting the various stylistic singing traditions. 


Smoking 


Although not unheard of, smoking has long been taboo 
among classical singers. In contrast, it is more common 
among singers of some nonclassical styles, either during the 
break between sets or during personal time. In addition to 
gathering information regarding when the patient started 
smoking and how much he or she smokes, the health care 
provider should also ask what the patient smokes. Marijuana 
smoking is not usually revealed unless the patient is asked 
specifically. Because marijuana smoke is significantly hotter 
than cigarette smoke (depending on the delivery method), it 
is recommended that the patient (if he or she chooses to 
continue smoking) use a water-cooling device to cool the 
smoke and reduce the risk of serious thermal damage to the 
vocal folds. The patients should be offered information 
regarding the risks of smoking, as well as referrals to smok- 
ing cessation programs if they are interested in quitting. If 
the patient is touring, he or she may be interested in dis- 
cussing self-directed smoking cessation methods with the 
physician, or may wish to receive information about smok- 
ing cessation programs located in the city or town in which 
based. The patient is smoking cessation (whether tobacco or 
marijuana) is often an ongoing process that requires com- 
mitment and perseverance from the patient and support 
from the health care provider. Although we can never rec- 
ommend that a patient continue smoking, it is important to 
avoid making patients feel defensive by encouraging them to 
pursue cessation at their own pace. 


Hydration 


Maintaining adequate hydration is important for everyone, 
but is especially so for the professional singer. An inverse 
relationship exists between phonatory effort and hydration 
level, particularly for the high range (Sivasankar and Fisher, 
2002). Therefore, if the singer wants to be able to produce 
high notes with relative ease, he or she must be sure to 
maintain adequate systemic and superficial hydration. The 
National Research Council recommends total daily fluid 
intake of ~12 cups for men and 9 cups for women, in the 
form of noncaffeinated, nonalcoholic beverages, soups, and 
foods. Solid foods contribute ~4 cups of water, with an 
additional cup coming from the water of oxidation (Kleiner, 


1999). These daily hydration amounts are recommended 
for adults with average energy expenditures and environ- 
ments and must be adjusted depending on individual vari- 
ability. Because of the particular physical and environmental 
demands placed on professional singers, it may be extrapo- 
lated that they may require more oral hydration than the 
average sedentary adult (Von Duvillard, Braun, Markofski, 
et al, 2004). Maintaining an ambient humidity of ~40% may 
improve superficial hydration of the vocal folds (Lawrence, 
1981). Superficial hydration may also be enhanced by 
emphasizing nasal breathing rather than oral breathing, as 
nasal breathing warms and humidifies the air before it 
makes contact with the vocal fold mucosa. This has been 
shown to result in lower phonation threshold pressures and 
decreased perceived vocal effort during prolonged reading 
tasks (Hunter, Svec, Titze, 2006). I also concur with Mary 
Sandage, M.A., CCC-SLP, who frequently recommends that 
singers eat less-acidic “wet snacks” backstage in addition to 
drinking water. Anecdotal evidence suggests that wet- 
snack consumption results in decreased report of dry 
mouth and improved patient comfort. Wet snacks can 
include any nonacidic fruit or vegetable with high water 
content, such as apple or pear slices, peaches, grapes, 
melon, or cucumber slices. Acidic foods, such as oranges, 
grapefruits, lemons (including lemon juice), and limes, 
should be avoided because of their correlation with reflux 
exacerbation. 


Performance Practices: The Nonclassical Singer 


The demands placed on the professional singer depend upon 
the style of music sung and the type of schedule maintained. 
Nonclassical singers, such as those who singed gospel, folk, 
rock, country/western, and so forth, should describe the 
length of their sets (groups of songs), the number and type of 
musicians with whom they usually perform, the environ- 
ments in which they normally perform, and what kind of 
amplification is used. They should also describe in detail 
what they usually do between sets; if the singer has a quiet 
dressing room (or green room) in which to relax between 
sets, it stands to reason that he or she will likely have fewer 
symptoms related to vocal overuse at the end of the per- 
formance than if the singer spends the time between sets 
chatting with the band, smoking, and drinking caffeine or 
alcohol. Patients who sing with a band, often will benefit 
from simple adjustments to the monitors at the foot of the 
stage that provide feedback to the singer and band members. 
They may also benefit from in-the-ear monitors, under the 
supervision of an audiologist experienced in working with 
musicians. If the performance venue is a smoky bar or club, 
the singer will often benefit from placing small electric fans 
on the floor of the stage next to the monitors to create air- 
flow away from the singer. 


Performance Practices: The Classical Singer 


The performance demands of the classical singer are some- 
what different from those of singers of other musical styles. 
The classical singer is much less frequently amplified, rely- 
ing more heavily on optimal laryngeal function and the 
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singer’s formant (Sundberg, 2001). Though the performance 
venues for classical singers are rarely the smoky clubs 
endured by rock or jazz musicians, the opera houses and 
concert halls are usually very large and have acoustic prop- 
erties that are unique to each location. If the acoustic 
properties of the venue are suboptimal, it may result in 
“oversinging,” or hyperfunctional vocal projection due to 
decreased auditory feedback. Performance demands will 
also change as a function of the size of the orchestra with 
which the classical singer is performing. If the singer is per- 
forming an operatic role, it is important to know whether he 
or she has performed the role previously or if it is new as 
well as how much time the singer will be required to spend 
on stage for the role. Some operatic roles allow for occa- 
sional vocal recovery time during costume changes 
(Angelotti in Puccini’s Tosca is one example), whereas oth- 
ers require the singer to perform for the majority of a 3-hour 
production (The Governess in Britten’s The Turn of the Screw, 
for example). 

Some demands are common among all professional 
singers of all styles, however. How often does the singer per- 
form? Do he or she employ a consistent warm-up and cool- 
down exercise regimen? How often, where, and for how 
long does the singer rehearse? If the singer travels via plane, 
how soon after the flight will he or she be required to sing? 
If the singer is a premenopausal woman, is she often 
required to sing during her premenstrual period? Does the 
touring performer have “down time,” and if so, what does 
the singer do during that time? All of these factors play an 
important role in the overall vocal load of professional 
singers and are critical in understanding the responsibilities 
and expectations that they face. 


Business-Related Concerns 


When a singer becomes dysphonic and has to make a deci- 
sion whether or not to cancel a performance, the decision is 
usually influenced by business concerns as well as medical 
factors. Many professional singers are represented by agen- 
cies who arrange bookings for performance engagements. 
Singers who have international careers will sometimes 
employ agents in several countries who cooperate with the 
primary agent. Will the cancellation cause financial hard- 
ship for the patient as a result of not being paid? Is there a 
chance of causing difficulty with maintaining agency repre- 
sentations? Will the cancellation produce a perception of 
unreliability among local booking agents, concert promot- 
ers, orchestral conductors, or music festival managers? Will 
the cancellation result in the forfeiting of monetary 
deposits for performance venues and merchandising con- 
tracts? There are additional concerns regarding long-term 
effects of proceeding with the performance despite illness. 
How critical is this performance to the singer’s overall 
career? Will there be press coverage with subsequent per- 
formance reviews? Will the concert be recorded for later 
broadcast or recording sales? Will it be televised? These 
factors must be seriously considered when formulating a 
cost-benefit analysis of cancellation versus suboptimal per- 
formance or compensatory adjustments to repertoire or 
range. 


+ Evaluation of the Professional Singer 


Auditory-Perceptual Evaluation 


The auditory-perceptual evaluation of the professional singer 
is perhaps the most challenging portion of the examination 
process for the speech-language pathologist or otolaryngolo- 
gist who is not a singer him- or herself. The difficulty lies in 
the extraordinary auditory skills required to detect very sub- 
tle changes in timbre, pitch, and stability that in the general 
population would be considered within the normal range of 
variability. Usually, one will find that the primary complaints 
are more subtle among classical singers than among their 
colleagues who perform in other musical styles. 

First, the speech-language pathologist will want to evaluate 
the patient’s voice production during unstructured connected 
speech and sustained vowel production. This should be 
accomplished during the initial greeting and casual conversa- 
tion, as well as during the case history and initial examination 
phases. Special attention should be paid to perceptions of 
habitual pitch, speaking rate, overall loudness, roughness 
(including ventricular phonation or glottal fry), and strain. Be 
mindful of physical signs of vocal hyperfunction, including 
furrowed brow, stiff body posture, prominent musculature in 
the anterior neck, clavicular breathing patterns, clenched jaw, 
or raised shoulders. Does the patient have adequate prosody? 
Is the patient’s articulatory accuracy within functional limits 
based on informal perceptual evaluation? When the patient 
repeatedly produces sustained vowels, do the voice sound sta- 
ble in terms of pitch, quality, and loudness? If not, then the 
patient may benefit from a more in-depth clinical oral mech- 
anism and motor speech examination to illuminate or rule out 
a possible neurologic dysfunction. 

Next, attention should be paid to the voice produced dur- 
ing singing tasks. The patient will likely be able to sing a sec- 
tion of a particular piece that demonstrates the problem at 
hand. He or she may have also brought a recording to demon- 
strate the singer’s typical baseline quality. If the patient can- 
not adequately demonstrate the voice difficulty, he or she 
should be asked to produce vocal exercises that will illumi- 
nate various aspects of the current vocal quality. Pitch range 
can be examined using simple scales or slow glides. Control 
of loudness may be demonstrated using messa di voce exer- 
cises, using a single breath and a single pitch. This exercise 
starts with quiet phonation (piano) and gradually gets louder 
(crescendo) until reaching a loud phonation (forte), then grad- 
ually gets softer (decrescendo) until the patient is singing qui- 
etly again. The crescendo and decrescendo should be well 
controlled and gradual, and the exercise should be demon- 
strated on pitches representing the entire pitch range. Voice 
onset can be evaluated using arpeggios on /h/-initial sounds, 
with short staccato production. It is often useful to include all 
five of the Italian cardinal vowels, /i, e, a, o, u/ when complet- 
ing multiple repetitions of this exercise, as often singers will 
report more acute difficulty with certain vowel sounds over 
others. Throughout the singing examination, listen to ensure 
that the perceptual vocal quality is consistent throughout the 
range and that the vibrato (when appropriate) sounds well- 
controlled. Be sure to examine the patient’s physical presen- 
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tation during this process, ensuring that he or she is perform- 
ing the exercises without obvious extraneous tension. 


Traditional Classification System for Operatic 
and Classical Singers 


For classical singers, it is important to note their fach, or 
“voice type.” The fach system is the traditional German clas- 
sification system for classical and operatic singing voices, 
providing a convenient method for ensuring that singers are 
not asked to sing operatic roles for which their voice is not 
suited. The system is used more casually in the United States 
opera houses than in Europe. There are four primary voice 


types: 


soprano (the higher female voice) 
mezzo-soprano or alto (the lower female voice) 


+ 
+ 
+ tenor (the higher male voice) 
+ 


baritone or bass (the lower male voice) 


Although the range-specific description of these voice 
types is somewhat simplistic (the actual pitch ranges of the 
voice types tend to overlap, requiring classification based on 
perceived vocal quality in addition to pitch range), it illus- 
trates their most basic differences. Choral singers will often 
characterize their voices in this way, without additional clas- 
sification schemes. 

Operatic singers, particularly those who maintain inter- 
national careers, will often use more specific terminology to 
describe their voice type. It is important to note that mod- 
ern operatic singers often do not adhere to the confines of 
the formal fach system as strictly as the artists of past gen- 
erations, but the terminology is nevertheless still in wide 
use. An abridged summary of the more common terms fol- 
lows: 


Coloratura soprano 


+ High, light, agile soprano voice 


+ Must be able to sing very fast and complex progres- 
sions of notes 


Examples: Beverly Sills, Edita Gruberova 


+ Regnava nel silencio, from Donizetti’s Lucia di Lammer- 
moor 


+ O luce di quest’ anima from Donizetti’s Linda di Chamounix 


Dramatic coloratura soprano 
+ “Darker, heavier” sound 


Must still be able to produce very fast, high, complex 
note progressions 


+ Examples: Joan Sutherland, Maria Callas 


+ Der Hölle rache kocht in meinem Herzen, from Mozart’s 
Die Zauberflöte 


+ Casta Diva from Bellini’s Norma 


Soubrette soprano 


+ Light, young-sounding voice, with the best quality in 
the middle range 


No extensive coloratura progressions 
Examples: Kathleen Battle, Dawn Upshaw 
Batti, batti, o bel Masetto from Mozart’s Don Giovanni 


Mein Herr Marquis from Strauss’ Die Fledermaus 


Lyric soprano 
+ Most common female singing voice 
Light, airy voice with equal quality throughout range 
Examples: Renee Fleming, Kiri Te Kanawa 
Signore, ascolta! from Puccini’s Turandot 


Donde lieta usci al tuo grido d'amore from Puccini's La 
Boheme 


Spinto soprano 


“Spinto” means “squeezed” or “throttled” in Italian, 
and refers to a “richness” or “depth” in perceptual 
vocal quality 


+ Same pitch range as the lyric soprano, but “darker” 
color and more powerful loudness range (especially 
for high notes); can often sing a mixture of lyric and 
dramatic roles 


+ Examples: Leontyne Price, Jessye Norman 
+ Es gibt ein Reich, from Strauss’ Ariadne auf Naxos 


+ Dich, teure Halle! Grüß ich wieder from Wagner's 
Tannhäuser 


Dramatic soprano 


@ Heavy, brilliant, very loud vocal quality, must carry 
over large orchestra 


Reports of “thicker, stiffer” vocal folds are not well- 
documented 


+ Examples: Birgit Nilsson, Deborah Voigt 


Starke Scheite schichtet mir dort from Wagner's Götter- 
dammerung 


+ Pace, pace, mio Dio from Verdi’s La Forza del Destino 


Lyric mezzo-soprano 


+ Light vocal quality paired with a lower range, roles are 
often young males 


Must be able to sustain long phrases in upper and 
lower ranges 


Examples: Denyce Graves, Frederica von Stade 


+ Mir ist die Ehre wilderfahren (duet) in Strauss’ Der 
Rosenkavalier 


+ Voi che sapete in Mozart’s Don Giovanni 
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Coloratura mezzo-soprano 
¢ Similar vocal quality to lyric mezzo-soprano 


Must be able to sing very fast progressions of notes, 
similar to coloratura soprano 


+ Examples: Cecilia Bartoli, Marilyn Horne 
+ Cruda sorte! from Rossini’s L'Italiana in Algeri 


+ Una voce poco fa from Rossini’s Il Barbiere di Siviglia 


Dramatic mezzo-soprano 

+ Must be able to project over large orchestra in low range 
Most often seen only in operatic performances 
Examples: Dolora Zajick, Grace Bumbry 
Entwiehte Götter from Wagner’s Lohengrin 


Mon couer s’ouvre a ta voix from Saint-Saens’ Samson 
et Dalila 


Tenore leggiero 


+ Light, flexible tenor, especially in comic roles; lightest 
tenor voice 


+ Must be able to access upper range easily, with great 
flexibility 


+ Examples: Jerry Hadley, Cesare Valletti 
+ [| mio tesoro intanto from Mozart’s Don Giovanni 


+ Una furtiva lagrima from Donizetti's L’Elisir d'Amore 


Tenore lyrico, or lyric tenor 


+ Male equivalent of “lyric soprano,” often romantic lead 
in Italian opera 


Must be able to access full range with similar quality 
Examples: Luciano Pavarotti, Enrico Caruso 
Questa o quella from Verdi's Rigoletto 


La fleur que tu m’avais jetée from Bizet’s Carmen 


Heldentenor, or tenore dramatico 


Very strong, “dark,” “heavy” sound, mostly heard in 
Wagnerian opera 


+ Must be able to sustain very loud singing over large 
orchestra 


+ Examples: Ben Heppner, Jon Vickers 
+ Los der Anker! from Wagner's Tristan und Isolde 


+ Ein Schwert verhief mir der Vater from Wagner's Die 
Walktire 


Lyric baritone 


+ Lighter sound than dramatic baritone, range overlaps 
somewhat with tenors 


Must be able to project modal range while still easily 
singing high range 


+ Examples: Dietrich Fischer-Dieskau, Thomas Hampson 
+ Vedro mentre io sospiro from Mozart's Le Nozze di Figaro 


+ Largo al factotum from Rossini’s Il Barbiere di Siviglia 


Heldenbariton, or dramatic baritone 


+ Similar to heldentenor, lower range and “darker” vocal 
quality 


Must be able to sustain projection of low to middle 
voice range very loudly 


+ Examples: Rene Pape, George London 


+ Wozu die Dienste ohne Zahl from Wagner's Tristan und 
Isolde 


+ O du mein holder Abendstern from Wagner's Tannduser 


Bass, or bass-baritone 


Low range, “dark” vocal quality; pitch range overlaps 
with baritone somewhat 


Must be able to project lower range over orchestra 
Examples: Samuel Ramey, Bryn Terfel 


O Isis und Osiris from Mozart's Die Zauberflöte 


¢- ¢ è% o 


Madamina! Il catalogo il questo from Mozart's Don 
Giovanni 


Acoustic and Aerodynamic Evaluation 


Acoustic and aerodynamic data are important tools to docu- 
ment the patient’s vocal quality and perceived vocal effort, 
respectively. Multidimensional voice profiling can document 
the stability of frequency and amplitude, show the preva- 
lence of voice breaks, and document the noise to harmonic 
ratio, among many other data points. If the technology is 
available, it is important to document acoustic data to sup- 
port the perceptual observations made by the patient and 
the speech-language pathologist. 

One of the more useful applications of acoustic data col- 
lection is the voice range profile, or the phonetogram. The 
phonetogram shows the maximum variation of amplitude 
(or dynamic range) as a function of frequency. In the modal 
range, the patient should have a quite wide dynamic range. 
As the patient demonstrates dynamic range near the upper 
and lower range limits, the range should become smaller. 
The result will be a data plot that looks roughly like an 
American football in a normal, healthy singer. There is a dif- 
ference in phonetogram results as a function of vocal train- 
ing (Verdolini and Titze, 1994). Disordered singers will show 
truncated dynamic and/or pitch ranges, depending on their 
presenting complaints. The phonetogram is a valuable tool 
to document patient concerns and to show changes made 
during therapeutic intervention that are easily explained to 
the layperson. Although the phonetogram can certainly be 
created using state-of-the-art acoustic analysis equipment, 
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it can also be produced by simply using a simple pitch pipe 
and a sound-level meter. Therefore, even if the speech- 
language pathologist does not have access to advanced 
acoustic equipment, he or she can still produce this very 
valuable documentation with a minimum of expense. 

Phonation threshold pressures are a form of aerodynamic 
evaluation that document the minimum air pressure required 
to initiate vocal fold vibration. This measure speaks to the 
patient’s perception of vocal effort, with increased phonation 
threshold pressure correlating with increased perceived 
effort. The patient’s phonation threshold pressure value will 
change as a function of hydration (Bless and Hirano, 1993), 
pitch, and possibly degree of vocal training. Decreased phona- 
tion threshold pressure values after behavioral therapy would 
objectively document progress and would support the 
patient’s perception of improved vocal effort. 


Palpation of the Anterior Neck 


Palpation of the anterior neck in the area of the laryngeal 
structures has long been supported as an important part of 
any voice evaluation (Aronson, 1990; Aronson, Peterson Jr, 
and Litin, 1964; Roy, Ford, and Bless, 1996), though it may be 
contraindicated in some specific instances (Angsuwarangsee 
and Morrison, 2002). For singers, this component may be 
modified slightly to include palpation during spoken and 
sung tasks. Whereas moderate pressure may be used when 
the larynx is at rest (to identify signs of persistent hyperfunc- 
tion), minimal pressure may be employed during connected 
speech and simple singing tasks to illuminate any differences 
in overall laryngeal elevation and extrinsic muscular behav- 
ior. The muscles to be palpated include the suprahyoid, 
thyrohyoid, cricothyroid, and pharyngo-laryngeal (inferior 
constrictor) muscle groups. Examination should be con- 
ducted prior to laryngeal endoscopy to avoid residual muscular 
tension in these muscle groups as an artifact of the examina- 
tion process. Patients may report tenderness in these areas, 
particularly if they are experiencing muscle tension dys- 
phonia. In addition to the tactile, this should also be noted as 
possible evidence of excess extrinsic muscular tension 
(Angsuwarangsee and Morrison, 2002). The examiner should 
remember that hypercontraction of the laryngeal muscles 
may not be due to a fundamental flaw in the patient’s vocal 
production technique, but may instead be a symptom that 
appears secondary to internalized anger, anxiety, or depres- 
sion (Aronson, 2004), which restates the aforementioned rec- 
ommendation to employ a wholistic approach to patient care. 


Indirect Laryngoscopy 


Clinical evaluation of the larynx often begins with indirect 
visualization using a light and a laryngeal mirror. This pro- 
vides valuable information regarding gross pathology and 
can also be useful if there are questions regarding color reso- 
lution during endoscopy, but this method is not sufficient as 
the sole source of laryngeal image evaluation. Although visu- 
alization of the larynx is greatly improved when using rigid 
oral endoscopy under a continuous halogen light source, it is 
critical that the endoscopic examination include images 
gathered under a stroboscopic xenon light source as well. 


Stroboscopic data have been shown to provide important 
information about subtle laryngeal structure and vibratory 
dynamics that is easily missed under continuous light source 
and can be significant enough to alter the medical and surgi- 
cal treatment plan (Bless, Hirano, and Feder, 1987; Sataloff, 
Spiegel, and Hawkshaw, 1991; Woo, Colton, and Casper, et al, 
1991). 

The benefits of indirect laryngeal endoscopy under halogen 
and stroboscopic xenon light sources have been well docu- 
mented (Hertegard, 2005) and have since become the stan- 
dard of care for most comprehensive voice evaluations. It is, 
however, beneficial to consider the unique needs of this spe- 
cialized patient population when choosing laryngeal visuali- 
zation procedures. Rigid oral endoscopy (including 
endoscopy using visualization angles of 70 or 90 degrees) 
provides excellent image clarity, allowing for detailed exami- 
nation of minute laryngeal structure and vocal fold vibratory 
dynamics. However, this modality only allows for sustained 
vowel production (usually /i/). To see what the larynx and 
surrounding structures are doing during functional voice use 
tasks (speaking and singing), flexible fiberoptic nasal 
endoscopy must be employed. Standard flexible endoscopes 
are usually able to provide adequate visualization of the lar- 
ynx under halogen light, but using a stroboscopic light source 
to visualize vibratory dynamics is, at the time of this writing, 
often quite challenging during nasal endoscopy. Furthermore, 
the image quality during nasal endoscopy is not as precise as 
that of rigid oral endoscopy, making it rather necessary to 
employ both modalities to gain a complete impression of 
functional laryngeal behaviors. It must be noted that distal- 
chip endoscope technology produces an excellent picture 
resolution during nasal endoscopy under halogen light 
source, but connecting this type of nasal endoscope with 
commonly used stroboscopic equipment and software, at the 
time of this writing, is often challenging. High-speed oral 
endoscopy is an extremely valuable tool for identifying subtle 
anomalies in vocal fold vibratory cycles, particularly in 
patients who present with irregular or aperiodic vibratory 
cycles that would confound the standard stroboscopic tech- 
nology. Continued research and further technologic advances 
will certainly make high-speed laryngoscopy a more widely 
available diagnostic tool (Snelgrove, 2006; Yan, Ahmad, Kun- 
duk, et al, 2005). 

Whether the voice evaluation consists of standard 
rigid/flexible endoscopy or includes more technologically 
advanced imaging techniques, the role of the speech-lan- 
guage pathologist is significant. It is critical to use these 
images to identify technical or behavioral anomalies that may 
be influencing the patient’s overall voice outcome. The pri- 
mary goal is to compare the patient’s perceptual vocal quality 
with the images gathered during laryngoscopy. If the images 
suggest a better laryngeal function than the perceptual qual- 
ity demonstrates, then a behavioral etiology is indicated. At 
this point, a treatment plan should be devised that is consis- 
tent with the patient’s goals and performance schedule. If the 
patient is able to attend multiple consecutive treatment ses- 
sions, the goals for treatment should be written in terms of 
the desired change in vibratory dynamics, as objectively 
assessed via laryngoscopy. Examples of such goals are as 
follows: 
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The patient will achieve open/closed quotient of 70/30 dur- 
ing phonation using Resonant Voice Therapy as assessed 
via videostroboscopy. 


or 


The patient will achieve complete vocal fold closure using 
Resonant Voice Therapy during sustained /i/ as assessed 
via videostroboscopy. 


This method of writing goals for voice therapy emphasizes 
the effect of behavioral voice treatment on vocal fold physi- 
ology by identifying the targeted physiologic goals needed to 
improve perceptual vocal quality. The distinction between 
the role of the speech-language pathologist and the physi- 
cian in interpreting and implementing the objective 
videostroboscopic data is also highlighted, providing clear 
support for the active involvement of the speech-language 
pathologist in the endoscopic evaluation process. The ASHA 
and the American Academy of Otolaryngology—-Head and 
Neck Surgery (AAO-HNS) have issued a joint position state- 
ment clarifying the roles of the speech-language pathologist 
and the otolaryngologist in the performance and interpreta- 
tion of videostroboscopy (American Speech-Language-Hear- 
ing Association, 1998). This is available on the ASHA Web site 
at www.asha.org. 


¢ Treatment 


Considerations for Treatment 


Professional singers who are touring frequently cannot 
attend multiple treatment sessions. In fact, they often have 
difficulty returning to the clinic at which they were evalu- 
ated, simply because of their traveling schedule. Therefore, 
it is often necessary to provide the patient with behavioral 
exercises via a self-directed home practice regimen at the 
time of the evaluation appointment. For many singers, this 
involves detailed discussions regarding behavioral and 
environmental precautions to help control LPR, recommen- 
dations for periperformance practices, and establishing 
consistent warm-up and cool-down exercise regimens that 
can be reasonably implemented without multiple sessions 
of therapy. 

Given patients’ anxiety regarding their dysphonia, it is 
likely that they will have difficulty recalling some of the 
information presented to them during the evaluation and 
therapy processes. Inaccurate recall of medical information 
may lead to noncompliance with therapy regimens (Behrman 
and Sulica, 2003). Therefore, it is highly advisable to provide 
patients with written instructions for all recommendations 
and behavioral exercises, as well as copies of diagnostic data 
and reports if available. Because many professional singers 
use the Internet to maintain communication with family and 
friends during their concert tours, they will often travel with 
a laptop or notebook computer. Most patients will appreciate 
having a copy of their endoscopic video file on CD for their 
personal medical records, particularly if their follow-up 
examination will be completed at another clinic. 


Indications for Surgery 


Most surgeons prefer to exhaust all medical and behavioral 
options before they will consider performing phonosurgery 
on a patient who is a professional singer, though this largely 
depends on the diagnosis. Anxiety among professional 
singers is a natural reaction, and often the speech-language 
pathologist is intimately involved in providing information 
regarding expected outcomes and perioperative behavioral 
treatment. Because a surgical recommendation usually pro- 
vides a pause in the patient’s touring schedule, preoperative 
therapy is indicated to train behavioral exercises that will be 
used postoperatively, as well as to reinforce the surgeon’s 
recommendations regarding postoperative voice rest and 
realistic expectations. 

Because postoperative recommendations for total vocal rest 
vary somewhat widely between surgeons (Kleiner, 1999), it is 
important that the communication between surgeon and 
speech-language pathologist be sufficient to ensure that con- 
sistent information is provided to the patient. The preoperative 
therapy session is also an opportune time to establish plans for 
postoperative compensatory strategies, including changes to 
the patient’s outgoing message on his or her answering 
machine or voice mail, procuring materials to support primary 
written communication, and identifying situations, behaviors, 
and environments to be avoided during the period of total 
vocal rest prescribed by the surgeon. It is critical that the rec- 
ommendations made during this session be provided to the 
patient in written format for future reference. Postoperative 
vocal rest may also mean “modified” vocal rest, a period during 
which the patient is allowed to phonate for conversational 
speech but is not recommended to resume performance. The 
period of vocal rest, whether total or modified, should be made 
on a case-by-case basis, the decision-making process being 
shaped by the severity of pathology and surgical intervention 
and by the patient’s expected vocal load when resuming per- 
formance. Using the analogy of Olympic-caliber athletes being 
recommended for carefully modified training rather than bed 
rest after some surgeries, professional singers should not be 
recommended for a prolonged period of vocal rest simply as a 
matter of course. They will most often benefit from adjust- 
ments to their training to ensure adequate healing of the surgi- 
cal site while still maintaining the high level of muscle tone 
and neuromuscular control that the have worked for many 
years to develop. The surgeon and the speech-language 
pathologist, through effective communication and collabora- 
tion, can create a plan for return to vocal function that strikes a 
balance between avoiding the deleterious effects of prolonged 
vocal inaction with the need to maintain a laryngeal environ- 
ment that will effectively facilitate adequate postoperative 
wound healing. Research is continuing in the area of vocal fold 
wound healing at the time of this writing, but it is generally 
accepted that due to the detrimental effects of scarring on 
vocal fold viscoelasticity and vibratory dynamics, great care 
must be taken to minimize the risk of vocal fold scar in the 
population of professional singers. The current literature does 
suggest more rapid postoperative reestablishment of the base- 
ment membrane zone in an environment of total vocal rest in 
a canine model (Cho, Kim, Lee, et al, 2000). Branski and col- 
leagues showed in a rabbit model that the wound-healing 


282 Clinical Voice Disorders 





process was characterized by neo-lamina propria deposition 
at 3 days postoperatively, with complete epithelial coverage 
achieved at 5 days, but by the 21-day mark the layered struc- 
ture of the lamina propria had not yet been regained (Branski, 
Rosen, Verdolini, et al, 2005). Coupled with the results that 
Behrman and Sulica found regarding variable postoperative 
voice rest instructions (Kleiner, 1999), these data show that the 
optimal recommendation for resuming preoperative vocal 
load after phonosurgery remains elusive. Until further 
research provides more definitive guidelines, return to vocal 
function should be determined after complete evaluation of 
the extent of the surgical intervention and the patient’s 
expected functional requirements. 

Postoperative therapy is also recommended (if possible) to 
ensure accurate performance of assigned behavioral tech- 
niques. Recommended therapy techniques include (but are not 
limited to) Resonant Voice Therapy (Verdolini-Marston, Burke, 
Lessac, et al, 1995) and Vocal Function Exercises (Sabol, Lee, and 
Stemple, 1995; Stemple, 2005; Stemple, Lee, D’Amico, et al, 
1994). It is not unusual for professional singers to be unable to 
participate in behavioral voice treatment for multiple sessions, 
and the speech-language pathologist is therefore challenged to 
establish an effective independent practice program in a single 
session. Although the program should certainly be tailored to 
the needs of the individual patient, the somewhat structured 
nature of Resonant Voice Therapy and Vocal Function Exercises 
can be helpful for instances such as this. And although the liter- 
ature suggests some degree of individual variability (Blaylock, 
1999; Milbrath and Solomon, 2003; Motel, Fisher, and Leydon, 
2003; Welham and Maclagan, 2004; Vintturi, Alku, Lauri, et al, 
2001), the patient will often benefit from the establishment of 
warm-up and cool-down exercises to be performed at the 
beginning and at the end of the day, respectively, particularly 
for the singer who has not had previous formal singing training. 
These exercises may often include pitch glide tasks performed 
using lip/tongue trills (Titze, 1996), arpeggios, humming, and 
messa di voce exercises (Titze, Long, Shirley, et al, 1999). 


Suggestions for Continuing Education 


For the speech-language pathologist who is not a singer him- 
or herself, it may be challenging to obtain adequate continu- 
ing education to develop a sense of confidence with this spe- 
cialized patient population. For some other subspecialties, 
there are brief and efficient conferences or certification pro- 
grams available to provide this education. Working with pro- 
fessional singers is, by contrast, a lifelong educational 
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Chapter 13 


Extended Case Studies in Psychogenic Voice Disorders 


Case Study 13.1 Psychogenic Loud Voice 


History 


Ms. R., a 35-year-old single woman, called for an appoint- 
ment to discuss what she describes as a problem of an 
excessively loud voice. Ms R., who works for a newspaper, 
was recently reprimanded for speaking too loudly. This 
patient describes her inability to keep the intensity of her 
speech down to appropriate levels as a lifelong problem. 
She has decided that the time has come when she should 
do something about this tendency or else review its social 
and vocational repercussions. 


Examination 


The patient is a pleasant, obese woman whose voice is 
somewhat low in pitch and potentially quite loud. How- 
ever, during the examination, she demonstrated ample 
ability to maintain a normal loudness level of her voice 
appropriate to the clinical situation. 

Psychosocial examination reveals that the patient is 
one of six children who had to use her voice and speech 
as a means of defense while she was growing up. During 
high school, she was repeatedly reminded she was too 
loud. She considers that all her life she has been verbally 
aggressive and defensive. She volunteers that where she 
is concerned, loudness is equated with anger. Ms. R. has 
had past psychologic problems of self-acceptance and 
has had both psychologic and, currently, psychiatric help 
to learn to accept herself to a greater extent than she 
has, and not to become angry with herself or to take the 
blame for all interactions that go bad. During much of 
this discussion, she cried quietly. Her attitude lifelong 
has been that “the world is not fair,” but she volunteers 
that she is the one who will have to change. She is aware 
that she is considerably overweight and has recently 
become concerned about her personal appearance. She 
intends to diet and learn to dress in more sophisticated 
fashion. 
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Impression 


Ms. R. has a problem with poor control over loudness of her 
voice in the direction of excessive loudness, which appears 
secondary to personality and adaptation to her environ- 
ment, traits that she has possessed throughout much of 
her life. The examining clinician sees her voice disorder as 
just the surface of a deeper problem of hostile reactions to 
others, but in a woman who appears to have considerable 
insight and motivation to change. She believes that the 
psychotherapy, infrequent as it is, that she has been receiv- 
ing from her psychiatrist has helped her gain increased self- 
acceptance and motivation to change. 


Therapy 


While it appears that Ms. R.’s voice disorder is primarily 
psychogenic and that a change in the aggressive use of 
her voice ought to parallel improvement in her self-image 
and her relationships with others, the need for sympto- 
matic voice therapy to learn control of her loudness level 
is also evident. 

Two years after symptomatic voice therapy, Ms. R., now 
37, returns for a brief visit and review of the problem that 
first brought her to the clinic. Since the initial visit, she has 
been seeing her psychiatrist, approximately once a 
month, for depression and to work out some of her inter- 
personal problems at home. She claims she was able to 
control her loudness problem until 5 weeks ago when she 
was reprimanded by her supervisor for speaking “too 
loudly again” and using the phone too much. The patient 
says, “That got to me.” It is important to note that the 
patient had been prescribed Valium (diazepam, Hoff- 
mann La Roche, Nutley NJ), which may have resulted in 
some disinhibition of behavior. Valium was subsequently 
terminated, and the patient was on Limbitrol (chlor- 
diazepoxide, amitriptyline, ICN Pharmaceuticals, Inc., 
Costa Mesa, CA) at the time of her subsequent visit. 

During this evaluation, the patient’s speech is very well 
modulated, with no signs of loss of loudness control. The 


clinician and patient discussed the probable atypical situ- 
ational nature of her loss of loudness control and that no 
symptomatic voice therapy was indicated. They also dis- 
cussed the importance of continued psychotherapy with 
the objective of improving self-understanding. The clini- 
cian suggested that she do some reading in the area of 
interpersonal relations and self-help recommendations 
that have been reported in the literature. An additional 
symptom during the current examination was a change in 
the patient’s articulation. The patient’s articulation was 
+1 ataxic dysarthric, very possibly related to her current 
medication but also could be a sign of neurologic disease. 
She did not complain of any gait or upper-extremity inco- 
ordination. Thus, reassurance appeared to be the method 
of choice in this situation. The patient was encouraged to 
remain in touch. 

Some months later, Ms. R. called to say that she had 
completed her psychotherapy, that it had been quite 
helpful as she had learned to regard herself in a better 
light, that she does not have to be angry, that she learned 
where the anger was coming from, and that her generally 
reduced hostilities reflected itself in much less difficulty 
with excessive vocal loudness. She felt she was doing 
quite well both psychologically, with regard to her speech, 
and in her ability to control her loudness during conversa- 
tions with others. People have remarked that she is less in 
evidence: “| don’t know where you are anymore,” one 
said. She also reported that she was successful in control- 
ling her loudness during a recent “argument” with a 
friend. Some days it is an effort to remain aware of the 
need for control. She continues with her self-improve- 
ment plans; has been for “color analysis” wearing differ- 
ent makeup, intends to modify her wardrobe, and to 
continue working on her obesity. She still sees her psychi- 
atrist, reporting fewer and less intense bouts with depres- 
sion. She takes lithium. An important clinical insight: her 
loudness has been the main reason for her previous 
job/interpersonal failures. It resulted in “punishment” for 
her, as she puts it, “a self-fulfilling prophecy” that, per- 
haps, she needed to fail and that her speech was a direct 
route to alienation of others. She said she no longer 
needed to do this. At this point, it appeared that she had a 
good hold on her loudness control and that voice therapy 
was close to termination. Consequently, she was 
requested to call again in about 2 weeks for a check on 
progress and then be terminated from treatment. 


Case Study 13.2 Psychogenic Aphonia 


History 


Miss K. is a 33-year-old, single, remedial reading teacher 
who had sudden loss of voice associated with laryngitis 1 
month prior to examination. Since then, she has been 
essentially whispering. She has had two to three previous 
voice losses, which have lasted only a day and which 
remitted spontaneously. This time, however, it has lasted 
a month. 
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Examination 


Otolaryngologic examination disclosed normal vocal fold 
structure and ability to adduct and abduct the vocal folds. 
During the speech pathologist’s voice examination, the 
patient converses in a strained whisper. She demonstrates 
+2 musculoskeletal pain in the thyrohyoid space on the 
left. Her coup de glotte was normal again indicating nor- 
mal vocal fold adduction. 

With minimum manual mobilization of the laryngeal- 
hyoid area, the patient’s voice came back within minutes. 

The psychosocial history indicates that in all probabil- 
ity, Miss K. was suffering from an accumulation of exter- 
nal pressures or stresses on or about the time she lost her 
voice a month prior to the evaluation: (1) her mother had 
entered the hospital, and the patient had become quite 
worried about her (it turned out that her mother did not 
have any serious illness); (2) the patient was pressured by 
an accumulation of paperwork that usually occurs at the 
end of the school year, but this year it was especially bad 
because of failure of various test services to return papers 
to her that she needed for processing; (3) she had been 
experiencing a flare-up of her multiple schlerosis (MS) 
consisting of paresthesias of the hands and feet, dizzi- 
ness, a feeling of thick-headedness, and increased diffi- 
culty walking; (4) she has had an ongoing problem with 
her colleagues who teach regular classes and who resent 
her small caseload and the interference that they per- 
ceive she is creating by taking children out of the class- 
room for special help. 


Impression 
Psychogenic (probably conversion) aphonia. 


Recommendation 


The patient’s voice had returned to normal early during 
this examination. Once voice was achieved, the remain- 
der of the session focused on the relationship between 
voice losses and external stresses so as to provide some- 
what greater insight for the patient, so that if she per- 
ceives that pressures are accumulating, she might con- 
sider avoiding taking on excessive responsibilities. 


Case Study 13.3 Psychogenic Hoarseness 


History 


A 24-year-old business major was referred by an ear-nose- 
throat doctor for evaluation of episodic hoarseness and 
sore throat that was of 6 to 7 years’ duration, from which 
time the patient was a college freshman. She had alter- 
nated going to school with working, accounting for the 
extended duration of her undergraduate years. During the 
prior few months, voice and throat discomfort worsened. 

ENT examination revealed essentially normal-appear- 
ing vocal folds except for some erythema in the region of 
the vocal processes. 
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Examination 


This patient’s voice was perceptually normal. However, 
she felt throat discomfort and described a feeling of a 
“baseball in my throat sometimes” (globus). 

The psychosocial history was highly positive. Over the 
past 15 years, she had accumulated a tremendous 
amount of unexpressed anger at her father and especially 
her stepmother and her stepsisters for the way in which 
she was treated—made to feel unwanted and creating a 
“her kids—our kids” conflict. The patient spent a year in 
another city away from the family, and it seems that since 
returning more of her angry feelings have come to the 
surface, and she has been communicating her anger and 
resentment in a small way toward members of her family. 
This, however, is very difficult for her because, typically, 
her entire mode of dealing with others is to suppress 
anger, which she regards as “scary” and which she 
equates with “violence and revenge.” She freely admit- 
ted, “I don’t know what to do with anger.” 

The moment she was asked about her personal life, she 
began crying and did so on and off for the remainder of 
the examination. She described her stepmother as “kind 
of ridiculous as a mother, an alcoholic.” The patient was 
tired of being “treated like dirt.” She described years of 
suppressed pain, “I have a lot of anger but you can’t do 
that.” She felt as if she has been “stomped on for years and 
years.” As she cried, she said, “I think | must be upset at 
myself for taking this for so many years . . . now | feel like | 
should say stuff, but it’s pretty scary.” She said she felt 
better the other day when she was crying and expressing 
her anger and hurt toward her stepmother but that it hap- 
pened very infrequently because of her censorship of her 
feelings. “No one has ever seen me angry or raise my 
voice.” 

Possibly contributory has been the fact that she leads a 
very active life, working hard as a student, holding down a 
job as a secretary, and admitting that she pushes herself 
“really hard.” In the previous year, she had mononucleosis 
and did not think she ever recovered completely from that 
and reported frequently feeling tired and not getting 
enough sleep. 

She saw a psychologic counselor briefly. 


Impression 


The combination of the erythematous posterior glottis 
and case history suggest that this patient’s episodic dys- 
phonia and throat discomfort may be due to esophageal 
reflux and musculoskeletal tension. The musculoskeletal 
tension appears secondary to chronic emotional stress 
and unexpressed anger, resentment, and hostility toward 
members of her family, occurring in a young woman 
whose approach to interpersonal relationships is one of 
not expressing emotions of anger, particularly because of 
her perception of its social unacceptability. 


Recommendation 


The voice clinician devoted a considerable portion of the 
initial session toward explaining to the patient the rela- 
tionship between abnormal voice, laryngeal muscle ten- 





sion, and suppressed emotions, particularly anger. The 
patient, rather typically, found it difficult, but not impossi- 
ble, to accept this explanation because of feeling guilty 
and that to have a physical problem because of emotional 
difficulties seemed like a sign of “weakness.” At the same 
time, she seemed to recognize the connection, and she 
was strongly advised to follow an antireflux regimen and 
to consult with a clinical psychologist or psychiatrist to get 
out into the open much of the pent-up emotions that she 
has never fully expressed to anyone. 

This case is yet another classic example illustrating that 
there often is not a single etiologic basis for laryngeal 
complaints. Determining whether the emotional state 
was the basis for the reflux, or if the reflux set up the tissue 
irritation that when combined with musculoskeletal ten- 
sion caused the globus, or if the conditions were initiated 
with the mononucleosis and maintained by the muscu- 
loskeletal tension secondary to the emotional state, can- 
not be determined. What can be determined is that there 
are co-occurring factors that need to be treated concomi- 
tantly. 


Case Study 13.4 Psychogenic Adductor Spasmodic 
Dysphonia 


This next case history is presented as a transcript of the 
initial evaluation of a 31-year-old married woman who 
exhibited the perceptual characteristics of adductor spas- 
modic dysphonia. She had a strained, strangled voice with 
frequent voice stoppages. The case is illustrative of how 
clinicians can ask questions to help patients discover facts 
about the onset of their voice problems and to uncover 
the psychodynamics of the individual. It emphasizes the 
importance of recognizing how difficult it is for patients to 
accept having a problem related to stress and recommen- 
dations to receive psychotherapy. It also suggests clini- 
cians should consider the possibility of muscle tension 
dysphonia when hearing characteristics of adductor spas- 
modic dysphonia. 

Clinician (C): When did the strained voice first start? 

Patient (P): | really became strained | guess 6 to 7 
months ago. When | say strained, | mean to the point that 
words would not come out whenever | tried to express 
them. 

C: But when was the very first change in your voice? 

P: Actually a year and a half to 2 years ago. The reason it 
is so amazing is because it wasn’t anything that hit me; it 
was very subtle. 

C: What specifically did you notice a few years ago? 

P: Well, that | would wake up and have a gruffness of 
voice and not have a sore throat or cold. 

C: Now, this was 1 to 2 years ago? 

P: Yes. About a year and a half, and it would just be 
maybe a day or two and people would ask me if | had a 
cold and | wouldn’t and then it would and it would just 
fade away so | never really gave it much consideration. 

C: And it was off and on, episodic for the last year, year 
and a half? 


P: Right. 

C: And then when did the voice change into a different 
pattern? 

P: | would have to say probably November of last year. 
That’s when | really started noticing that it was sporadic. 

C: About 6 months ago? 

P: Yes. 

C: And what happened then that was different from 
before? 

P: Well it just . . . in conversing, words just would not 
come out and | think, besides, it was a huge amount of 
tension or knots in my stomach. 

C: Did the voice sound or feel more strained? 

P: Yes, strained is a good word. | felt like it was a strain to 
talk. 

C: And did it remain that way throughout the entire 6 
months? 

P: Well, to me it has gotten worse. 

C: But when it started out 6 months ago, how often 
would you have it? 

P: | could get it like 2 or 3 days of talking normal and 
then maybe a week of not being able to, or just barely 
being able to talk. 

C: And when did it become constant? 

P: | would say constant within the past 3 months, con- 
stant where it’s almost every day. And as | said, there are 
still points where | have 2 or 3 days where it is perfectly 
clear. 

C: Do you associate those days of clarity with anything 
else? 

P: Not really. It’s usually when I’m away from the house. 

C: And where would you be? 

P: Well, out riding my horse or out visiting with people 
on the farm or something like that. 

C: Your voice would be better in general when you were 
away from the house. 

P: Right. And it is always better in the morning. When | 
first wake up, | am perfectly clear. And when I have a few 
drinks of alcohol, it seems to be much clearer. 

C: You’ve been married how long now? 

P: Since August; 9 months | guess. 

C: Were you having emotional problems 1% to 2 years 
ago? 

P: No. 

C: But you have been over the past 6 months? 

P: Yes. 

C: Please describe that. 

P: | think I’ve been unhappy in a situation that | haven’t 
done anything about, whereas normally | would. If I’m 
unhappy in a situation | change it and | just haven’t done 
anything about it. The marriage. 

C: When did you become concerned enough about the 
voice to do something about it, to investigate the voice as 
a problem? 

P: The first time about a month ago. 

C: And what did you decide to do then? 

P: | went to talk to a specialist; an ear, nose, throat man. 

C: And what did that doctor tell you? 

P: The only thing that he told me is that he didn’t see 
any damage to the vocal folds. He said that he did see, or 
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he thought there was a small cyst at the end of my vocal 
folds and that he recommended | speak to a speech 
pathologist. He wrote to the speech pathologist that | had 
spastic dysphonia. 

C: What did the speech pathologist do? 

P: Well, she told me she didn’t think it was spastic dys- 
phonia and she set it up that | would have sessions twice a 
week, 30 minutes at a time that we never did do, because 
this doctor had talked to a laryngologist who specializes in 
spastic dysphonia. 

C: So you never had any voice therapy? 

P: Right. Never have. 

C: Why not? 

P: Well, . . . after we talked to this doctor ~30 minutes 
on the telephone he was ready to do surgery that to me 
was a turnoff because it just didn’t make much sense to 
have an operation when it hasn’t even been investigated. 
And, a couple of times when | had the appointment set up 
with the speech pathologist, my husband finally said, 
“Well, there is no need to even go talk to her.” He said 
“We’re going to come up here.” 

C: You mean on the basis of talking to you on the phone 
he asked you to come out and have surgery? 

P: He was ready to. He said that we would of course do 
the investigation and think out everything but he said that 
the only, or in his opinion he was convinced, the only solu- 
tion was to clip the nerve in the vocal fold and he 
explained it in detail to my husband and myself. But... | 
haven’t had any speech therapy with anyone yet. 

C: When you saw the psychiatrist this morning, your 
voice was worse than it is now? 

P: Yes. 

C: How much worse? 

P: | think it was more difficult to get words out. 

C: Much better now or just mildly? 

P: Just mildly better. 

C: Did it help you to discuss your situation with the psy- 
chiatrist? 

P: Immensely. And afterwards | discussed it with my 
husband, which made me feel much better. 

C: You felt much better? 

P: Yes. 

C: What was your husband’s reaction? 

P: Well, he is very... he doesn’t want to accept the fact 
that we don’t have a marriage and it’s very difficult for 
him. He doesn’t understand a lot of things about it which 
| understand. 

C: What did you tell your husband after you talked with 
the psychiatrist? 

P: Well, | just explained to him what we discussed about 
marriage and certain reasons why | didn’t like marriage. 
And that | did feel my inward tensions. Because I’ve never 
been a nervous person or shaky, you know, about anything, 
and just since we’ve been married I’ve been fighting the 
divorce or separation. For my husband’s sake, | wanted to 
stay married. And, well | just told him exactly what we dis- 
cussed and that the psychiatrist had recommended psy- 
chotherapy and he also asked that we consider separation. 
I’m the one that has been worrying about the divorce, you 
know. Just go ahead and end it and get it over with. 
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C: The one who has what? 

P: Wanted to just end it completely, the relationship 
and as far as a marriage goes. But, the few options that he 
mentioned | thought sounded pretty good. 

C: And what did you husband think about the possibili- 
ties? 

P: Well, he would like to discuss it with the psychiatrist 
and have more insight. | did tell him exactly what was dis- 
cussed and I think he might want to talk to you, too, which 
| have no objections to. 

C: But is it your interpretation that the psychotherapy is 
supposed to be some way of forestalling the eventual sep- 
aration and divorce or is it something you have decided 
on? 

P: Well truthfully, psychotherapy | think is great and | 
have no objections to it. | don’t think that that is needed in 
my case. | think what | need is probably somebody that 
knows how to work with someone that has a voice prob- 
lem. Mentally, I’m very fit. My only discontentment is I’m 
in a marriage situation that | love the man but | don’t want 
to be married to him. 

C: Well, that is a rather interesting split there. 

P: Well, yes itis. It’s very strange. 

C: How do you account for that? 

P: | guess | account for it by the fact | don’t like the 
closeness. I’ve lived 29 years very happily single. | love 
being single. My favorite things are to ride the horse out in 
the woods for days or read books. Of course, | love being 
with people also. | was an executive so | had to be with 
people but, marriage isn’t anything like | thought it would 
be. 

C: What did you think it was going to be like, and how is 
it? 

P: Well, | thought it would be great deal of sharing emo- 
tionally and materialistically, all the things that you share, 
but I didn’t think it would be sharing everything, meaning 
the husband; he doesn’t want to do anything unless | am 
with him. Whether it’s golf or whether it’s a trip. To me it 
has been very draining because I’ve never in my life been, 
even when | lived with a big family, and we’ve had all that 
closeness, I’ve never had that much closeness in that way; 
to me it is like 24 hours of... 

C: Do you feel stifled by it? 

P: Yes, and I’ve told him that and explained it to him and 
he doesn’t do it purposely. | think it’s the first time he’s 
ever been in love and he’s gotten carried away with it. And 
the difference being he’s a person that needs me and | 
don’t need him. | don’t need him in the way, | mean to ful- 
fill my life and that is a big difference in our relationship. 

C: Throughout your life, have you been the kind of per- 
son whois able to express her emotional feelings freely or 
have you been the kind who has kept them to herself? 

P: | would have to say 50-50. For the past 15 years | 
would say I’ve been very outwardly spoken. 

C: Did you have trouble confronting your husband with 
your feelings about the marriage? 

P: Not at all. In fact, before we got married, | was really 
fighting getting married and | expressed that to the psy- 
chiatrist and you know he said, “Well, why did you?” And | 
said | wished | knew because it just happened real sud- 





denly and my husband is very persuasive person and, I’ve 
always really cared for him, too. 

C: But you went ahead with it anyway. Did you express 
your misgivings to your husband before you were mar- 
ried? 

P: Very much. | told him | was not a touchy-feely person. 
| mean I’m loving and | have a lot of love to give, and | can 
accept love but he is starved for love in the way that he 
really needs constant patting on the head and a lot of 
attention and a lot of what | can’t, | guess, give, you know, 
if | were in the right frame of mind. But there are things 
that | can’t give him that he needs. And | don’t know if 
there is any reason for him to lack for that just because of 
me. 

C: May | ask whether you think your voice is better now 
than when we started? 

P: Much better. 

C: Do you and your husband have much in common? 

P: Oh gosh! We have everything in common. We’re 
both very adventuresome in the way that we love to 
travel, we love to play golf, we both ski. 

C: Do you feel right now as if you have a load off your 
chest, your mind about this whole thing? 

P: Yes. 

C: Because there has been some movement here and 
you've gotten closer... 

P: Well, | did because | kept rejecting talking to anybody 
professional about it. | kept thinking, well, whatever it is 
will go away tomorrow, because tomorrow would be fine. 
And then, when | finally did get to the point where it was 
so frustrating not being able to converse and express or 
even go to a counter and order a bottle of perfume with- 
out somebody saying “What are you saying?” When | 
finally did go to somebody, all it did was confirm my gut 
feelings, my inner feelings, which were that | felt like it was 
something to do with keeping things in. In other words, 
I’ve always been a happy person, and I’m always jovial and 
never depressed and just lately I’ve been in a state of 
depression. 

C: How long? 

P: Well, I’d say about 3 or 4 months. And that’s when my 
voice told me that what I’m doing is that I’m fighting 
myself. I’m not letting myself do what | naturally do, 
which is always to do whatever makes me happy. 

C: Do you think the voice has more or less paralleled the 
depression? 

P: | feel like it has. 

C: The more depression the worse the voice? 

P: Yes, because | feel like it, in other words, as | said ear- 
lier I’ve never been a nervous person, well lately I’m not 
nervous but I’m very, very tense. | lay down to go to sleep 
and try to sleep and then just start thinking. To get to 
sleep | have to use progressive relaxation techniques. lIl 
slowly go from toe to head and relax each muscle section 
of my body. 

C: But, how has coming up here and talking to the psy- 
chiatrist or to me helped you? 

P: Well... itisn’t a physical thing or | mean I don’t have 
a tumor or | don’t have something physical . . . | feel more 
comfortable about that. | also feel more comfortable 


because | hadn’t talked to anybody at length about this, 
and after talking to the psychiatrist he suggested psy- 
chotherapy and mentioned the fact that we might want 
to consider a 6-month separation, which I’m open- 
minded to. 

C: Well, what do you want to do? 

P: | think it is probably worth a try to first be away from 
each other for 6 months and for me to seek whatever ther- 
apy | need during that 6 months. 

C: If you’re so convinced that marriage is not for you, 
what value would separation have? 

P: Well, that’s a good point, because |’m almost saying a 
contradiction. 

C:... prolonging something that you think... 

P: Well, that was something | talked about when we 
went back to the room after | talked to the psychiatrist. | 
told him that as far as | was concerned, a divorce is fine 
with me, but for my husband’s sake if he wanted to make 
ita 6-month separation possibly | may find out I’m making 
a bad mistake and want to come back. 

C: In other words, there might be some value in separa- 
tion. 

P: Possibly so. 

C: You may be 99% sure but you may not be 100% sure. 

P: Exactly. That’s a very good way of putting it. Like | said 
to the psychiatrist, | don’t want to make it sound like I’m 
sloughing the voice thing off on the marriage but | know it 
has gotten increasingly worse and | figured it had some- 
thing to do with it. 

C: The last 6 months that you’ve had the voice trouble 
have you been away for any length of time like a couple 
days or a week? 

P: Yes, | went to another city 3 weeks one time. 

C: How was your voice then? 

P: It was real good. 

C: Was it normal or just good? 

P: lwould say there was only 1 day that it wasn’t normal. 

C: So the 3 weeks you were away, the voice was essen- 
tially normal? 

P: Yes. 

C: And when you got back home it was bad again. 

P: Yes. 

C: And how long ago was that? 

P: Seems like it was about 3 months ago that | went 
away. 

C: Well, it seems to me that the voice trouble fluctuates 
so closely with how you are feeling, whether you are 
relaxed or whether you are anxious, depressed, or any- 
thing like that. | don’t think voice therapy twice a week is 
the answer at all. | think the answer is reducing some of 
that tension in the area of your voice box, and getting 
some psychological help with the major problem that has 
caused the voice trouble. In other words, you don’t have a 
habit pattern established here. | believe that voice will 
return to normal the moment you get out from under 
stress. And, in the meantime I'll show you today how you 
might reduce the neck tension. Does that sound right? 

P: Most of the time, yes. 

C: Right now for example, you are more relaxed and you 
have put the cards on the table. Your voice is nearly nor- 
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mal. When you were away for 3 weeks the voice was 
almost normal because you were away from the situation 
that was causing you to be tense. 

P: The only thing about that is before | was even married 
| was having problems with the gruffness for no reason, 
and I don’t know if this stems from that or... 

C: | don’t really know. One of our problems is that the 
farther back you go the harder it is to remember how you 
were feeling. Now, presumably we're talking about an 
extension of that earlier problem. It is entirely possible 
that this is a new disorder and that the other one was 
related to something else. Did the other one, a couple 
years ago, sound this way or was it a different sound? 

P: No, it was just a bit of a gruffness in the voice. 

C: It is conceivable that these are not related and this is 
a brand new disorder that started during the time of your 
marriage. But, it is conceivable also that the gruffness that 
occurred a couple years ago was due to some tension, 
too. Do you remember having been depressed at that 
time? 

P: Yes, | did go through kind of a move. | had a beautiful 
condo and | had to sell it and go to work. 


Postscript 


Manual circumferential therapy was demonstrated to the 
patient with positive results. She left the session with a 
normal-sounding voice and claimed the voice was free of 
tension and that she would follow up with psychotherapy. 
Three weeks later, the patient’s husband called to say that 
his wife was dissatisfied with our diagnosis, that her spastic 
dysphonia was worse, and that she thought the cause was 
organic, not psychologic. She planned to find another ENT 
specialist for further investigation. Two years later, she 
returned to the clinic for further consultation. She had 
found another specialist who injected Botox® (botulinium 
toxin, Allergan, Irvine, CA) with which she had intermittent 
success. She terminated that treatment, began psy- 
chotherapy, divorced her husband, and developed a near- 
normal voice. She returned for voice therapy to be 
reminded how to reduce and/or control laryngeal tension. 


Case Study 13.5 Psychogenic Mutational Falsetto 
(Puberphonia) 


History 


Dr. L. was a single, 29-year-old resident in dermatology 
who was referred by an otolaryngologist for the purpose of 
evaluation of along-standing voice disorder. Dr. L. believed 
that he had had a hoarse, high-pitched voice for as long as 
he could remember. He recently had become concerned 
over his voice when another resident remarked to him that 
he was “known by his unusual voice.” When he was an 
intern, he was told bya more senior physician that his voice 
had a “bad effect” on patients. Were it not for these 
remarks made within the past few years, and particularly 
more recently, Dr. L. would not have sought help for his 
voice. When questioned further about the voice, the 
patient indicated that he considered it more “hoarse” than 
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high-pitched, and he noted difficulties being understood 
when having to speak in a noisy environment, such as at 
parties. His family had never mentioned his voice to him. 


Examination 


Dr. L. was a very pleasant young man noticeably small and 
short in stature who spoke in a high-pitched, falsetto 
voice. Phonation under conditions of maximum inspira- 
tion and sharp glottal attack on phonation produced a 
normally low-pitched voice, indicating normal laryngeal- 
pharyngeal anatomy for a completely masculine voice 
and that the falsetto voice was a form of learned behavior. 

Psychosocial history was positive and helped to explain 
the etiology of his falsetto voice. Dr. L. came from a family 
of modest circumstances, whose parents escaped perse- 
cution in Europe. He described his parents as considerably 
opposite in personality: his mother as highly aggressive 
and verbal, his father passive, silent, and unassuming. Dr. 
L., as far back as he could remember, had witnessed con- 
siderable family arguments and turmoil. During his early 
developmental years, he witnessed violent, uncivilized 
behavior, feeling ashamed over what the neighbors, adja- 
cent to their apartment, would think about his parents’ 
fighting, and often felt like hiding during such conflict to 
avoid being recognized by his neighbors. 

Dr. L. decided to study medicine not only out of his 
intellectual interest in the field but also because he was 
determined to bring some happiness and honor to the 
family, especially in view of his older brother’s troubles 
with drugs and his general failures as a person. Dr. L. com- 
pleted his premedical studies with high grades and felt 
“thrilled” to be in medical school. 

It is important to note that Dr. L.’s entire approach to 
communicating verbally with others was one of compli- 
ance and compromise. He admitted to difficulty con- 
fronting, disagreeing, or arguing with others, preferring 
to take a conciliatory approach except on matters that 
clearly indicated the necessity for him to stand his 
ground. He admitted to a supersensitivity toward others’ 
opinions of him, admitted to a passive approach to peo- 
ple, stated that “confronting creates hostility,” and gener- 
ally feared using speech to express his opinion on issues 
even of a mundane nature. 


Impression 


Mutational falsetto (puberphonia), possibly secondary to 
ambivalence toward confronting others via speech, of 
long-standing duration. 


Recommendation 


Direct voice therapy was begun on the morning of the day 
of his examination, and in a brief period the patient was 
able to produce a completely normal, low-pitched, rea- 
sonably clear voice. He was advised to employ the low- 
pitched voice over the lunch hour and to return during the 
early afternoon. 

Upon returning for continued therapy, and after having 
been asked about his experience with the low-pitched 
voice over the lunch hour, the patient began to express, 





using the low-pitched voice, a high degree of anxiety over 
whether or not he would have the “guts” to return to 
where he was to continue his residency. He complained 
that the low-pitched voice took a considerable amount of 
effort, in spite of the fact that he had been using it for a 
couple of hours without objective evidence of fatigue. At 
that point, a considerable amount of time was devoted to 
discussion of his fears about returning home with his new 
voice. He frankly admitted that he was “afraid—everybody 
is used to my old voice,” and what would he tell people 
back home about the ease with which the low-pitched 
voice was acquired? He even went so far as to suggest he 
fabricate some diagnosis that he could use in describing 
the results of the voice examination, one that would indi- 
cate an organic cause for his dysphonia, and some medical 
or surgical therapy resulting in its improvement. He did 
not like the idea of mutational voice disorder as an expla- 
nation because it carried with it too much of the implica- 
tion of psychogenicity. He was particularly fearful that his 
new residency chief at the university would think that 
there was something peculiarly different or wrong with 
him, even though he admitted that prior to this time he 
had only met this physician three times and on each occa- 
sion the contact was brief. Dr. L.’s concern over his image 
and future acceptance extended to the point of requesting 
that the voice clinician not even write a letter back to the 
referring physician indicating a diagnostic impression, for 
fear that this information would end up in the patient’s 
medical record and would be prejudicial to him. 

In spite of these trepidations, Dr. L. volunteered that, 
despite his misgivings over returning home with the low- 
pitched voice, it was imperative for him to do so, because 
he had become convinced during the treatment sessions 
that the old, falsetto voice could have long-term negative 
effects on his professional and social success. The patient 
was discharged with him using his low-pitched voice and 
was given a tape to record his voice to return to the voice 
clinic in 1 month. 


Postscript 


The 1-month posttreatment tape returned to the clinic 
was of a normal adult male. Dr. L. stated that using his 
new voice was no longer a problem and that to his sur- 
prise few people even questioned the lower sound. 


Case Study 13.6 Psychogenic Adductor Spasmodic 
Dysphonia 


History 


Ms. W. was a 35-year-old black woman who had a congen- 
ital sensorineural hearing loss, for which she wore a hear- 
ing aid, and ocular hypertension. She was referred to the 
voice clinic because of the development of a voice disor- 
der over the prior 3 years. 


Examination 


Ms. W. had intermittent adductor laryngospasms during 
conversational speech but fairly clear voice during vowel 


prolongation. She also had a mild interdental lisp. Motor 
speech examination was normal. 

The psychosocial history was highly positive. The 
patient had a history of psychiatric problems for which 
she had been hospitalized for 1 month. She described her 
psychiatric disorder as a “nervous breakdown” that she 
believed was precipitated by the death of her sister. Dur- 
ing the hospitalization she had psychotherapy, but by the 
time of the voice evaluation she was only seeing a psychi- 
atrist once every 5 to 6 months. 

The patient lived with her other sister, who was a 
teacher. Ms. W. complained bitterly that ever since she 
came to live with her sister, she had been ordered around, 
made to perform household chores that she believed 
went beyond her responsibility, especially since she 
worked all day lifting heavy objects. At one point, her sis- 
ter became physically abusive toward the patient. She 
described her life at home as unhappy, but she was unable 
to move out or talk back for fear of alienating her sister 
further. “She tells me what to do. She speaks for me. And, 
I’m just there to work my head off.” Had she been finan- 
cially able, she would have moved out. (It is important to 
note that as the patient expressed her anger, and inciden- 
tally did so quite competently, her laryngospasms disap- 
peared and her voice became virtually normal.) 

After the interview with the patient, her sister was 
escorted into the office to discuss the situation, the sister 
claiming “I had no inkling she was angry.” The patient’s 
sister defended her behavior by saying that if she didn’t 
remind the patient to perform certain tasks around the 
house, or keep appointments, the result would be confu- 
sion. The sister described herself as the matriarch of the 
family who had gone out of her way to keep her siblings 
together and that, having given 10 years of her life to their 
welfare, she was fed up with family problems, meaning 
that she was not interested in pursuing counseling 
together with the patient to bring about a more equitable 
relationship with the patient in her home. 


Impression 


Psychogenic adductor spasmodic dysphonia, in a patient 
with a history of psychiatric problems who exhibited 
extreme anger toward her sister, was trapped in a domestic 
situation that she found almost intolerable but who could 
neither express her anger nor extricate herself from the sit- 
uation because of her financial inability to be on her own. 


Recommendation 


The clinician’s recommendation was that Ms. W. and her 
sister return together to talk to a psychologist. The clini- 
cian attempted to facilitate this by bringing out Ms. W.’s 
feelings in the presence of her sister thinking it might 
have some ameliorating effect on the situation. The three 
of them discussed what the clinician thought to be the 
cause of the voice disorder. When the sister balked at the 
idea of joint psychotherapy, the clinician recommended 
that Ms. W. return to her psychologist and to explain the 
results of the voice examination that suggested this 
patient needed to talk out her hostilities with the psychol- 


13 Extended Case Studies 291 





ogist even though her sister would not accompany her. It 
was also recommended that the patient return for peri- 
odic visits to recheck her voice and personal status. 


Postscript 


Financial constraints prevented the patient from follow- 
ing the recommendations. On the occasions when she did 
return, her voice was variable. At times her speech was flu- 
ent and without any hint of effort or dysphonia. These 
were times she reported being at peace with her sister. At 
other times she presented with a voice nearly identical to 
that observed in the initial evaluation. This case raises 
interesting questions. Even though her disorder is 
thought to be of psychogenic origin, would the patient 
benefit from any direct symptom management? From 
Botox injections? What is the role of the speech-language 
pathologist when patients fail to follow recommenda- 
tions for psychotherapy? 


Case Study 13.7 Psychogenic Aphonia Masquerading 


as Myasthenia Gravis 


History 


Mrs. W. is a 39-year-old computer operator who has a his- 
tory of 4 years of generalized weakness of the voice and 2 
years of trouble with her speech. Specifically, she describes 
problems with breathing; a weak, low, soft voice; and defec- 
tive articulation. She had been suspected of having myas- 
thenia gravis, although a definite diagnosis had not been 
established. She consumed large does of Mestinon® (pyri- 
dostigmine bromide, USP, Valeant Pharmaceuticals Int., 
Aliso Viejo, CA). She said her speech deteriorated with pro- 
longed speaking, and it also disintegrated when performing 
strenuous activities using other muscles of the body. 


Examination 


Oral-physical examination was essentially normal. She 
may have had a mild degree of soft palate asymmetry 
with a tendency for it to pull up to her right on phonation. 
On speech examination, she had a possible +1 “flutter,” 
and she phonated with a + 1 breathy voice quality. 

On the stress test for myasthenia gravis, the patient’s 
voice deteriorated to a complete whisper by the time 
she had counted to 123, at which time her coup de 
glotte was —3 weak, whereas prior to the stress test it 
was normal. Alternating mean rates (AMRs) of 
diodochokinetic productions of /p/ /t/ /k/ were of nor- 
mal rate and regularity. 

Under medical supervision of a physician and a nurse, a 
placebo dose of 0.9 mL saline was injected. Initially, her 
voice completely deteriorated by the time she had 
counted to 185; at such time 0.2 mL saline was injected, 
and shortly thereafter the voice cleared completely for 
several seconds before returning to a state of aphonia. 
After the injection of the remaining 0.7 mL saline, the 
voice returned to completely normal and stayed that way. 
Her coup de glotte also became normal. 
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Impression 


Psychogenic aphonia masquerading as flaccid dysphonia 
or myasthenia gravis. 


Recommendation 
Consider psychiatric consultation. 


Postscript 


This case is noteworthy not so much for the specifics of 
the psychosocial history but because she seemingly pre- 
sented with such a clear-cut history of a neurogenic disor- 
der. It was only through the injection of the placebo that 
the true underlying nature of her problem was revealed. 


Case Study 13.8 Psychogenic Adductor Spasmodic 
Dysphonia 


History 


Mrs. B. was a 58-year-old, married farm wife who gave a 
15-year history of episodic aphonia and dysphonia peak- 
ing during the prior 10 months in a continual state of 
aphonic-dysphonic voice. She had been treated in the 
past by local doctors with voice rest. She had been under 
the impression that her voice episodes stemmed from 
sinus drainage. 


Examination 
+ ENT examination revealed normal vocal folds. 


Oral examination was normal except for some sugges- 
tion of lower facial weakness or asymmetry on the 
right, which, by itself, was uninterpretable. Other 
speech structures were judged to be normal. 


è +3 musculoskeletal tension and pain in laryngeal- 
hyoid region 

@ +2 strained-hoarseness (patient said her voice was 
better since her throat was “sprayed in ENT”) 


+ Psychosocial: The only problem area that appeared 
causally related to the dysphonia of recent origin was 
that the patient had a falling out with her sister over 
the latter’s refusal to help care for their 82-year-old, ail- 
ing mother. The patient cried when discussing this sub- 
ject: she was hurt, disappointed, frustrated, and addi- 
tionally upset that she found it hard to talk with her 
mother over the phone because of her dysphonia and 
her mother’s hearing difficulties. 


Impression 
Adductor spasmodic dysphonia, psychogenic. 


Therapy 


Catharsis plus laryngeal manipulation to reduce muscle 
tension yielded ~20% voice improvement. It was stressed 
that the patient divest herself of the idea that she needed 
to rest and protect her voice and that she begin to think of 





psychologic rather than physical explanations for her dys- 
phonia. An additional therapy session was scheduled for 
the next day. No improvement was seen over the initial 
session. She exhibited moderate adductor spasmodic 
dysphonia. She was having trouble accepting a nonor- 
ganic explanation for her dysphonia. The clinician tried to 
honestly explain the basis for interpretation of her case. 
The patient was judged to not be a very self-searching 
type of person and was unable to amplify on her emo- 
tional problems. Even her admittedly normal voice for 2 
days, while on vacation in Hawaii, she attributed to the 
weather. The session concluded with the advice she use 
the voice as much as possible and to return for additional 
therapy. 


Postscript 


With several additional sessions of catharsis plus laryn- 
geal manipulation to reduce muscle tension, further 
voice improvement was achieved but consistently nor- 
mal voice was not. She claimed she was able to talk with 
her mother on the telephone and no longer feared that 
voice rest was necessary to preserve her voice. This is 
another instance where if her voice were not meeting her 
social and occupational needs, some clinicians might rec- 
ommend Botox to supplement behavioral management. 
Botox was not recommended in this instance because the 
patient did not wish to exchange periods of breathiness 
for her now mild-moderate episodic periods of adductor- 
spasmodic dysphonia (AD-SD). Moreover, she felt her 
voice was adequate for her needs. Psychotherapy was not 
pushed because she resisted the connection and did not 
appear to be able to be self-reflective. 


Case Study 13.9 Psychogenic Hoarseness 


History 


Mrs. F. is a 28-year-old married woman referred by oto- 
laryngology for evaluation of a voice disorder and throat 
pain. Her voice troubles go back 3 years when she lost her 
voice for 5 days and then again for another 3-week period 
during which time she “whispered.” Her latest bout with a 
voice disorder began with a cold and hoarseness. She 
became aphonic (whispered speech) for 1 month; her 
voice improved and then got worse again. A local oto- 
laryngologist advised her to speak in a higher-pitched 
voice as a means of treatment. 


Otolaryngology 
Examination was negative. 


Voice Examination 


The patient’s voice was mildly hoarse at times during the 
examination, but for the most part her voice quality was 
normal and there was little to indicate a voice disorder. 
However, she had +2 musculoskeletal tension pain in the 
left laryngeal hyoid region to palpation, which the patient 
said spontaneously would migrate to her ear. 


The psychosocial history was positive. Gradually 
throughout the examination, it became apparent that 
this sensitive, moderately religious woman was in conflict 
with herself about a perplexing and anger-producing situ- 
ation that had developed between her and her sister over 
the past few years. Her sister, whom the patient “loves 
very much,” began to have marital problems and extra- 
marital affairs that violated the patient’s and the rest of 
the family’s basic religious convictions. Recently, her sis- 
ter’s marital situation ended in divorce. However, what 
disturbed the patient most of all was that her sister was 
angry at her for not “supporting” her sister’s decision to 
divorce. Mrs. F.’s sister had accused her of being self-right- 
eous and lacking in understanding. The patient had taken 
these accusations to heart and had felt “sad, angry, and 
guilty” because of her objections to her sister’s behavior. 
The patient appeared in conflict because she perceived 
that her sister, who was barely on speaking terms with the 
rest of the family, wanted her verbal support for what she 
had done, and the patient could not find it in her con- 
science to give that approval. It was interesting to note 
that the night before she lost her voice, most recently she 
had had a depressing conversation with her mother about 
her sister in which the patient reiterated some of her sis- 
ter’s accusations. It was also interesting that the patient 
said, “kiddingly,” “after | lost my voice | compared myself 
to a Biblical character who also lost his voice after some 
kind of conflict.” The patient lost her voice the night that 
she had had the conversation with her mother. 

Exacerbating the situation was the obviously iatrogeni- 
cally damaging advice that she received from her oto- 
laryngologist that she should raise the pitch of her voice. 
Assuming the patient’s report on her voice before and 
after raising the pitch was correct, the advice had a con- 
siderable detrimental effect on voice production. The 
high pitch production increased her musculoskeletal ten- 
sion to the point of obliterating her voice and increasing 
her pain. 


Impression 


Psychogenic hoarseness and psychogenic musculoskele- 
tal tension pain in the laryngeal-hyoid region. 


Recommendation 


The clinician and patient had an extensive discussion 
about how the patient appeared to have taken her sister’s 
accusations seriously, blaming herself rather than looking 
at her own rights to her opinions about the family situation 
more objectively. Mrs. F. seemed to have developed some 
understanding of the relationship between her being 
upset with her sister and her voice disorder and throat 
pain. It was decided that she should talk over with her hus- 
band an alternative point of view, namely that it is not she 
who was to blame for her sister’s problems and that per- 
haps she ought to express her disapproval more openly 
without feeling guilty about it. She was also advised that if 
she still felt the situation were unresolved and, particularly, 
if her voice problems and throat pain did not abate, she 
should call for a referral for psychologic counseling. 
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Additionally, the patient was advised to disregard any 
advice pertaining to changing the way in which she uses 
her voice, to use it at a normal pitch and loudness, and not 
to regard her vocal folds as particularly vulnerable. 


Postscript 


During a follow-up call, the patient demonstrated an 
overall improvement in her voice and throat discomfort. 
She said that she appreciated the results of the consulta- 
tion and desired a referral for counseling. 


Case Study 13.10 Psychogenic Adductor Spasmodic 


Dysphonia 
History 


A 38-year-old waitress-bartender had had three episodes 
of abnormal voice during the past 5 months. The first 
episode occurred suddenly and lasted a week before the 
voice returned. During the second episode, the voice 
remained abnormal for 2 months before remitting. The 
third episode began 2 weeks prior to the voice evaluation. 
She reported the symptoms were variable. At times she 
was unable to get any voice out, and at other times it 
came out sporadically. Her throat felt swollen inside. 

Six days after the onset of the last episode, she con- 
sulted a local physician, who suspected a polyp on the 
basis of her case history but did not complete an indirect 
laryngoscopic examination. 


Laryngologic Examination 


Vocal folds were clear, fully mobile, with no evidence of a 
polyp, but a tiny web was visible in the posterior commis- 
sure. ENT exam was otherwise normal and her overall 
health considered good. 


Voice Examination 


This was a pleasant, smiling young woman who interacted 
easily with the examiner and exhibited intermittent 
adductor voice arrests with intermittent breathiness. 
Vowel prolongation was normal. Musculoskeletal pain 
tension was evident on the left side of the neck over the 
hyoid bone and thyrohyoid space. Palpating from the 
thyroid notch across the superior border of the thyroid 
cartilage, it was determined that there was excess approx- 
imation of the thyroid cartilage to the hyoid bone. 

The psychosocial history was positive. More or less tear- 
fully throughout the examination probing the psychoso- 
cial background, the patient disclosed three areas of her 
life that had produced affective changes but which she 
indicated she had tried to avoid becoming concerned 
about: (1) her impending divorce after 20 years of mar- 
riage and her children’s rejection of her current boyfriend; 
(2) an automobile accident within the recent past that 
killed three of her close girlfriends; (3) an unjust situation 
at work in which she was not being paid for the work that 
she was doing in place of her workmate’s maternity leave. 
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In greater explanation of the above, the following 
details emerged. The patient had been married for 20 
years to an alcoholic gambler who has had marital infideli- 
ties. One and a half years ago, she walked out. She would 
have done so sooner, but what precipitated her finally 
deciding to do so was having fallen in love with another 
man, which gave her the courage to leave. This situation 
was very difficult for her, particularly because of her chil- 
dren, ages 2, 18, and 19, who stayed with their father. 
Since her departure, her children have been angry at her, 
although her communication with them has improved 
over time. Nevertheless, the children blamed her 
boyfriend for the marital breakup and would not have any- 
thing to do with him. This situation was most upsetting to 
the patient who wanted an amicable three-way relation- 
ship. The conflict was most distressing to the patient 
because it prevented her from seeing her children at the 
same time she saw her new male friend forcing her to 
choose between the two. 

The patient feels guilty about her girlfriends who were 
killed in the accident, retrospectively wishing that she had 
spent more time with them. At first she said they were 
killed a few months after the onset of her voice disorder, 
however, later she admitted that she had forgotten when 
the accident took place and thought in actuality it might 
have occurred in closer proximity to the time surrounding 
the onset of her voice disorder. 

By personality, the patient, lifelong, has always sup- 
pressed her feelings rather than express them openly. For 
example, she allowed herself to be taken advantage of at 
work in not being paid for the work that she had to do in 
place of her fellow employee who went on maternity 
leave. By rights she should have complained about this sit- 
uation to her boss, but she was unable to muster the 
courage. Similarly, she had never stood up to her hus- 
band, fearing physical abuse. Instead of arguing with him 
when he would perpetrate a wrong pertaining to his 
drinking and other objectionable activities, the patient 
“just cried.” Allin all, the patient said, “It had been a crazy 
year.” 

It is interesting to note that throughout the disclosure 
of her history, the patient continuously smiled through 
her tears, and it was clear from her behavior and her state- 
ments that she has kept all of her problems to herself and 
tried to maintain a cheerful exterior. 

During the course of her disclosures concerning her 
personal problems, her voice improved ~50%. It improved 
another 50% with combined musculoskeletal tension 
reduction in the laryngeal-hyoid region. 

Finally, the clinician and patient talked about a possi- 
ble avenue of remediation of the conflict between her 
older children and her new male friend, namely, to 
obtain conjoint counseling at home during which all par- 
ties could rationally come to an understanding of one 
another. Moreover, it was suggested that the patient 
could use some individual counseling for the purpose of 
enabling her to express her feelings more extensively rel- 
ative to the long history of her marital problems and the 
incompletely expressed grief over the death of her 
friends. She promised to call in ~2 weeks and to come in 





for a recheck if matters were not coming along as 
desired. 


Impression 


Psychogenic, probably conversion, adductor spasmodic 
dysphonia. 


Postscript 


Spasmodic dysphonia is a disorder diagnosed by auditory 
perception of symptoms. Generally, this is a neurologic 
condition treated with a combination of voice therapy 
and medical/surgical treatment such as botulinum toxin 
injections. This is a case of an individual who exhibited the 
acoustic symptoms of adductor spasmodic dysphonia 
and a psychosocial history consistent with a psychogenic 
etiology. The successful combination of laryngeal re-pos- 
turing and talking therapy confirmed the diagnostic 
impression. It is important for clinicians to recognize that 
the aberrant voice characteristics resulting from excessive 
laryngeal muscle tension are symptoms, not etiology. 
Integration of information from examinations with care- 
ful case history-taking and response to vocal probes is 
essential to making the appropriate diagnosis and plan- 
ning treatment. 


Case Study 13.11 Psychogenic Aphonia-Dysphonia 


History 


Mrs. M. was a 49-year-old housewife and mother of two 
grown sons who was referred by internal medicine for 
evaluation of a voice disorder that had existed for 1 
month. Its onset was rapid, and although it had fluctuated 
in severity during the past month, it had never returned to 
completely normal. The patient did not complain of any 
chewing, swallowing, or drooling difficulties. ENT exami- 
nation was normal. 


Examination 


The motor speech examination failed to disclose any evi- 
dence of a dysphonia-related dysarthria. Her voice during 
contextual speech was characterized by breathiness that 
at times was constant and at times intermittently voiced. 
She was nearly aphonic. Her coup de glotte was relatively 
normal, and she had increased musculoskeletal tension 
and pain in the laryngeal-hyoid region. 

During the examination while testing for the muscu- 
loskeletal tension and pain by palpating her laryngeal- 
hyoid region, her voice became normal. 

The psychosocial history was positive. The patient 
described herself as a “workaholic” who had an almost 
religious need to work hard the same way her mother did. 
She said she was nearly always up at 5 o'clock in the morn- 
ing and worked all day long cleaning, cooking, and serving 
her family. 

She had been particularly upset the past 3 months 
because of fatigue and chest pain, crying as she described 


her frustration and anger at herself for not being able to 
continue her driving need to work and feeling guilty about 
being sick. 


Impression 
Psychogenic aphonia-dysphonia. 


Recommendation 


This patient’s voice remained nearly normal throughout 
most of the examination once it broke through during 
testing of her musculoskeletal tension in the neck. The cli- 
nician and patient discussed the excessive demands that 
she made upon herself and her inability to reduce her 
work output when she was not feeling physically up to par. 
They talked about the need for her to be less demanding 
of herself and its importance to the maintenance of 
improved voice and also about the possibility that it might 
benefit her fatigue and chest pain, presuming no organic 
findings for these complaints. The patient appeared to 
understand the relationship between fatigue, muscu- 
loskeletal tension and pain, her voice loss, and her anger 
and excessive demands that she had been making upon 
herself. This case required psychiatric consultation and 
recommendations for improved self-understanding and 
modification of behaviors. 


Postscript 


These cases always raise questions as to why work 
becomes too much for an individual at a particular point 
in time. Is it a cumulative process? Is it because she was 
consumed about worry of her health? Did she have upper 
respiratory infection (URI) initially for which she did not 
allow herself to properly recover before moving full steam 
ahead in working for her family? Did she have the typical 
“doormat” personality allowing all of her family to walk 
over her in spite of feeling fatigue and concerned about 
chest pains? Often, clinicians do not get the answer to all 
of these questions in taking a case history. What they do 
get is red flags of areas of concern that raise the ques- 
tions. Even though the voice is often relatively easy to 
return to normal, the underlying causal problems most 
often do not, and require professional help from someone 
skilled in working through psychologic issues. 


Case Study 13.12 Psychogenic Mutism and Aphonia 


History 


A 39-year-old nurse’s aide who worked at a nursing home 
and carried a diagnosis of “multiple sclerosis” was diag- 
nosed with “ataxic dysarthria” and was referred for 
speech reevaluation because of a complete loss of speech 
of 1% weeks duration. Although the chronology of her 
speech loss was unclear, what she seemed to be saying 
was that she went to her local physician because of her 
“slurred speech.” The physician then set up a reexamina- 
tion but shortly thereafter, before the scheduled reevalu- 
ation, her speech deteriorated to mutism. Except for brief 
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periods of normal speech, her mutism persisted over the 
1% weeks prior to the evaluation. 


Examination 


The patient entered the examination without any ability 
to speak whatsoever, laboriously writing notes as a means 
of communication (the patient complained that it was 
more difficult for her to write since the onset of her “mul- 
tiple sclerosis” making her mutism doubly difficult). 

The oral-physical examination showed normal symme- 
try and strength of all of her speech musculature. Despite 
inability to voluntarily phonate for speech, she was able to 
adduct her vocal folds strongly, producing a sharp coup de 
glotte and prolonged Valsalva maneuver at the laryngeal 
level. Her cough was normal. Velopharyngeal, tongue, 
and lip strength were normal. Yet, upon asking her to pro- 
duce alternate movements of her tongue laterally it 
became fixed slightly off the midline, quivering in that 
position. Attempts to produce AMRs for p-t-k resulted in 
complete, tonic blocking involving her labial and lingual 
musculature, and she was unable to produce any of these 
syllables. She was able to prolong a vowel in a very breathy 
voice. 

Within 15 minutes, the patient was talking normally, 
having been led from isolated sound production to sim- 
ple, familiar words. Beginning with counting, she pro- 
gressed from counting a few single digits to several, her 
phonation and articulation becoming stronger as she did. 
Then, transitioning to contextual speech upon answering 
questions, she began to speak but in a “telegraphic” man- 
ner, that is, devoid of articles of speech, which rapidly 
began to fill in until her syntax and grammar were normal. 
Also, as her speech emerged, she had tonic stuttering-like 
blocking on the initial phonemes of words. 

The psychosocial history taken from the patient was 
nonproductive. Despite having been asked four times as 
to whether or not there were any personal problems that 
might have been upsetting her on or about the time she 
became mute, she steadfastly denied that there were any 
such problems. However, near the end of the examina- 
tion, her daughter was invited into the room, who sug- 
gested there had been some potentially emotionally 
upsetting events that had taken place recently. (1) The 
patient’s 21-year-old son, who had been a serious behav- 
ior problem by age 17, suddenly left home and the family 
had not heard from him for 4 years until they tracked him 
down through Social Security and contacted him 2 
months prior to the onset of her mutism. He agreed to 
come home but then canceled, although promising to 
come a month later. (2) About the same time, the 
patient’s parents decided against traveling to witness 
their granddaughter’s graduation, which hurt the patient 
considerably. These events had occurred within the 
framework of a person who has admittedly been “raised 
not to show anger.” Her daughter said that when her 
mother became angry over these events, she shut herself 
in the bedroom behind a closed door. Despite the daugh- 
ter’s interpretation of these events, the patient continued 
to deny that events surrounding her son and her daugh- 
ter’s graduation had angered her. 
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Impression 
Conversion muteness. 


Recommendations 


(1) Psychiatric consultation. (2) If dysphonia, aphonia, or 
muteness recur, the patient should be seen for reevalua- 
tion and symptomatic therapy, not just with the idea of 
relieving her speech symptoms but to establish a better 
rapport that might lead to easier compliance with psychi- 
atry or psychology, if this should become a point of resist- 
ance. 


Postscript 


Initially, the history of MS, dysarthria, abnormal tongue 
movements, and negative case history presented by the 
patient might lead clinicians to conclude her mutism was 
neurologically based. However, the astute clinician would 
see the inconsistencies in the examination that suggest 
the mutism was voluntary. For a variety of reasons, such 
as privacy and access, bringing a relative into the session 
to augment the case history is not always possible. How- 
ever, when a clinician suspects the patient is not able to be 
forthright, and the patient does not object to another per- 
spective, observations from a significant other can be illu- 
minating. The fact that the patient continued to deny her 
feelings suggested that she was not too introspective or 
not willing to express her feelings at the time of the exam- 
ination. This could also be interpreted as a sign that she 
would not seek psychologic counseling, which indeed 
turned out to be the case. In these instances, when the 
patient returns for reevaluation or treatment, the speech 
clinician can often gain sufficient trust to help the patient 
understand the importance of dealing with the psycho- 
logic issues to prevent recurrence and lead a better life. 


Case Study 13.13 Adductor Spasmodic Dysphonia 


History 


Mrs. C. was a 62-year-old married woman who had experi- 
enced the onset of hoarseness 2 years after a severe bout 
of influenza. Her dysphonia worsened and she was even- 
tually diagnosed as having “spastic dysphonia.” She had 
eight voice therapy sessions, which had mild supportive 
benefit but did not alter her dysphonia. She gave no fam- 
ily history of a similar voice trouble or tremor. ENT exam 
was normal. 


Examination 


+ Contextual speech was characterized by moderately 
strained, harsh dysphonia. 


+ Vowel prolongation produced intermittent adductor 
laryngospasms, adventitious mandibular movements, 
blepharospasm, adventitious finger movements, and 
possible mild head tremor. 


@ Psychosocial history was devoid of any significant emo- 
tional stress, acute or chronic. 





Impression 


Adductor spasmodic dysphonia, dystonic (neurologic 
movement disorder) confirmed by neurology. 


Therapy 


Discussed. No therapy known with certainty. Not a candi- 
date for recurrent laryngeal nerve (RLN) surgery. Discus- 
sion of undergoing neurologist’s trial therapy with 
lithium. Patient wished to defer decision. 

Patient was asked to see speech-language pathologist. 


Family History 
None known for tremor or other neurologic disease. 


Personal History 


Two years prior to examination, the patient apparently 
had neck pain and stiffness, which was treated with cervi- 
cal traction with good result. She otherwise appeared in 
good health, on no medications. Approximately 2 years 
ago, the patient experienced a flu-like illness with fever, 
nausea, vomiting, and diarrhea. She became so dehy- 
drated that it required 3 days’ hospitalization with an IV. 
She recalled no treatment medications. Shortly after her 
hospitalization, she became aware of a raspy quality to 
her voice. Two or 3 months later, it was clear that she had 
a voice problem, which was diagnosed as spastic dyspho- 
nia. She believed her symptoms had been stable for ~1 to 
1% years. Vocal exam was characteristic of adductor spas- 
modic dysphonia; she had strained, strangled voice with 
frequent voice stoppages. The speech pathologist con- 
curred with the neurologist that there were observable 
slight fidgety movements of the digits of the hands at 
rest, and as Mrs. C. maximally sustained a vowel, quick 
transient blepharospasm-like movements about the eyes 
and quick movements about the chin with slight lateral 
jaw movement also were observed. There was good range 
of motion of the neck and no reduction in tongue, hand, 
or foot AMR. This clearly was organic, and there seemed 
to be a mild associated horizontal intermittent head 
tremor. There was no hand tremor evident. The speech- 
language pathologist, ENT specialist, and neurologist all 
concurred that surgical treatment should be deferred and 
medicinal therapy could be instituted. A battery of tests 
was conducted including Minnesota Multiple Personality 
Inventory (MMPI), hematology group, chemistry group, 
urinalysis, and 24-hour urine collection of methoxy- 
hydroxy-phenyl-glycol (MHPG). Electrocardiogram (ECG) 
studies were recommended to get a clearer picture of 
organicity. The speech-language pathologist and neurol- 
ogist discussed their lack of success with Inderal with or 
without Klonopin® (clonazepam, Hoffmann-La Roche, 
Co., Nutley NJ) and raised the possibility of using lithium. 


Postscript 


At the time of presentation, the symptoms of spasmodic 
dysphonia were mild and, according to the patient, had 
been stable for 12 to 18 months. Consequently, she decided 
to defer any treatment. The basis for the neurologist’s 


suggestion of lithium was that he had experience with one 
case of spasmodic dysphonia who had improved psychiatri- 
cally and vocally on lithium. Lithium had been a regimen 
successfully used in other movement disorder patients, and 
because this patient had spastic dysphonia, blepharospasms, 
and twitchy digits indicative of a movement disorder, the 
treatment seemed reasonable. The regimen used was 
lithium carbonate, one-half tablet of a regular 300 mg tablet 
three times per day. Measures of lithium levels were made 
after 3 days and 6 hours after the last orally ingested dose. 
Measures were then used to titrate the lithium appropri- 
ately. Later, to block tremors if they developed, Inderal was 
added as 10 to 20 mg three times daily and/or a tricyclic or 
tetracyclic depending upon the MHPG result. 


Case Study 13.14 Psychogenic High Pitch 


History 


Agirl aged 7 years 10 months was referred for a voice eval- 
uation by her local speech pathologist because of an 
abnormally high pitch and unintelligible speech. ENT 
exam was normal. Psychosocial history was provided by 
the child’s mother, the speech pathologist accompanying 
mother and child to exam, and from the medical history. 
At 2% weeks of age, the child was placed in a foster home 
because of abuse and an injury to her leg. Her parents 
were divorced when she was ~4 to 5 years of age. 
Recently, her mother and boyfriend decided for change 
and at the beginning of the school year moved to a new 
city. Shortly thereafter, the patient was seen for a speech- 
language evaluation. 

Evaluation by the speech-language pathologist 
revealed that the patient’s voice was unusually high pitch, 
her receptive language was within low normal/average 
range (Peabody Picture Vocabulary Test and Test of Lan- 
guage Development test), and speech was intelligible. 
Hearing was normal. The child did not interact much with 
other schoolkids. Since the speech-language patholo- 
gist’s evaluation, the child’s voice had progressively 
increased in pitch, speech had become less intelligible, 
and she had developed an increased use of gestures. The 
speech-language pathologist also reported that she inter- 
acted more with other schoolkids, as a group of them 
began following her around and “talking for her.” In the 
past month, her voice had significantly changed (higher 
pitch), and over Christmas vacation she had lost her voice 
completely for a few days when she had a URI. The 
speech-language pathologist reported that no time since 
the autumn had voice improved or been normal. Mother 
described the child as quiet, shy, and having become 
more withdrawn over the previous 6 months, during the 
summer interacting and playing less with children, even 
avoiding them when they came to play. She said the child 
has been less than affectionate or attentive with the new 
baby and almost disregards it. Over the summer, she had 
an incident of self-wetting and while upset said to mom, “I 
have a rough life.” Although mother described her as 
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“sensitive,” she was unable to give further detail or exam- 
ple. Apparently, the child had shown little concern about 
her inability to talk more effectively. In speech therapy at 
school, no change had been effected in the child’s voice 
with a variety of therapy approaches. 


Examination 


+ Oral mechanism Structures symmetric, and + normal 
in strength and movement. 


+ Articulation + normal (/f/ for Ja] substitution). 


+ Motorspeech Excess musculoskeletal tension was evi- 
dentin lips, cheeks, neck, and shoulder muscles during 
speech tasks, which included counting, sustained /a/ 
(~3 seconds), and contextual speech. Voice was vari- 
able; pitch was abnormally high; quality was harsh and 
breathy; cough and glottal coup were weak but of 
lower pitch than observed during speech tasks. Palpa- 
tion of laryngeal structures revealed an elevated 
larynx, mobile structures, excess neck strap muscle 
tension, and complaints of discomfort lateral to laryn- 
geal structures. Voice remained high-pitched through- 
out a variety of tasks. After practice and two to three 
successes, the child was able to blow. Under conditions 
of white noise masking binaurally at 80 dB, no change 
in either voice pitch or increased loudness was 
observed. When the patient was positioned supine 
with neck and head tilted backward to lower the larynx 
and prevent volitional elevation, inconsistent, inter- 
mittent seconds of change, lower pitch, but mostly 
persistent high pitch were observed. It was also noted 
that the patient was somewhat resistive to this task. 


Impression 


Psychogenic voice disorder of abnormally high pitch that 
could not be consistently altered during exam or with 
symptomatic treatment. During attempts to get voice 
change, the child generally smiled and appeared to put 
forth little effort suggesting the high-pitched voice may 
be volitional or elective. 


Recommendation 


Psychiatric consult and/or therapy. Speech therapy was 
not warranted at this time because of the child’s apparent 
resistance to change and probable associated gains from 
voice disorder (i.e., attentional gains). Referred patient 
back to pediatrician to follow up with psychiatric consult 
and asked the local speech-language pathologist to set up 
a follow-up appointment after psychiatric evaluation 
and/or therapy. Findings were briefly discussed with 
mother. Numerous family dynamics and situational stres- 
sors are likely relevant to the etiology of this child’s voice 
disturbance. Arrangements were made for appropriate 
psychotherapy in their local community. 


Follow-Up Speech Evaluation and Treatment Session 


Avideotape was made of the patient followed by play ther- 
apy. Speech was observed to be less intelligible and voice 
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of higher pitch than noted in previous evaluations. Several 
times during video taping and play, she cleared her throat 
in lower pitch prior to speech initiation. During play inter- 
actions, the child manipulated conversation by facial 
expression and gestural communication. She avoided 
answering open-ended questions with no response or 
shoulder shrugs and primarily used gestures and sophisti- 
cated facial expressions and pointing, and so forth, to 
communicate, with apparent avoidance of verbalization. 
The speech-language pathologist was unable to rule out 
conversion reaction as a psychogenic voice disorder factor 
and was unable to elicit consistent normal voice. 


Child Psychiatric Evaluation 


This 7-year 10-month youngster was referred from ENT 
and speech pathology, who share the conviction that the 
patient’s voice disturbance is psychogenic in origin. The 
psychiatric assessment included a 45-minute interview 
with mother and her common-law husband, Mr. Z., a 30- 
minute playroom evaluation, and a 15-minute summariz- 
ing discussion with the parents together. 


History of the Current Illness 


As well-documented elsewhere in the record, the patient’s 
voice disturbance began subtly and slowly with mild eleva- 
tion in pitch during the summer, which had gradually pro- 
gressed to an extremely high pitch, virtually unintelligible 
voice pattern, decreased spontaneous speech, and 
reliance upon gesture and other forms of nonspoken com- 
munication. Speech therapy had been tried but was unsuc- 
cessful. The school speech-language pathologist expressed 
concern about the social impact of the patient’s speech on 
her peers and family adjustment. At school, other children 
have come to treat the child in a very solicitous fashion: 
“speaking for her” and doing other things on her behalf 
(e.g., helping her on and off with her coat, etc.). 

Mother is aware that family and individual psychologic 
factors may be contributing significantly to her daugh- 
ter’s physical symptoms. 


Family and Developmental History 


Mother candidly acknowledged major adjustment con- 
flicts in her family of origin; attributable in part to her 
father’s chronic alcoholism about which mother was 
chronically embarrassed and blamed herself. 

After marrying her husband at age 19, she found him 
also to be an alcoholic and intensely emotionally abusive 
(plus there was question in the record about the child hav- 
ing sustained a traumatic fracture in early infancy). By the 
time the patient was age 2, her parents had separated, 
and a divorce was finalized the following year. Subse- 
quently, her father had paid no child support, had shown 
virtually no interest in her, and had had no contact with 
her whatever during the past 2 years. 

When the child was nearly 4 years old, Mr. Z. began dat- 
ing the patient’s mother on a regular basis, and he began 
living with mother and child on a regular basis. An unin- 
tended pregnancy eventuated in the birth of their son, 
and long-standing intentions to be married had been 





recently precluded by financial limitations resulting from 
Mr. Z. being laid off from his 15 years of employment 
when the business changed ownership; and mother real- 
ized that marriage meant the loss of her Aid to Families 
with Dependant Children payments, which had helped 
her acquire her licensed practical nurse training. However, 
both affirmed their intentions to eventually be married 
and their current perceptions of “functioning as a family 
already in every other respect.” They expressed appropri- 
ate concerns that the child’s biological father might 
oppose any effort to change her name through legal 
adoption, and in the event of a marriage, they are under- 
standably worried that the child would “feel left out” to be 
the only family member to retain the other family name. 
The worsening of the child’s voice disturbance since the 
birth of her half-sibling has raised questions about her pos- 
sible conflicts regarding her mother’s pregnancy and the 
birth of a sibling. However, her mother indicates that the 
child showed no particular distress during the pregnancy 
nor did she display any apparent adverse reaction to the 
baby. The only question the mother recounted was explic- 
itly directed to “where babies come from?” which mother 
acknowledged left her flustered and “not knowing the 
right thing to answer.” In fact, she had offered the explana- 
tion that babies “start from a seed” and before birth are “in 
their mother’s stomach”; very commonplace explanations 
that all too frequently lend themselves to distortions con- 
cerning fantasies of oral impregnation (which one might 
speculate could underlie the child’s compulsive “throat 
clearing” habit). However, it appeared that several other 
issues were perhaps more problematic for the child’s inner 
sense of emotional security and interpersonal conflict. 


+ Since the child’s birth, the maternal grandmother has 
been vigorously involved in the patient’s upbringing. 
After her marital separation, mother returned home to 
live with her parents; and out of practical necessity, 
much of the child’s care was entrusted to grandmother 
as mother was working and attending school. Over the 
years, maternal grandmother’s influence had become 
increasingly a point of contention, with grandmother 
indulging the patient’s every whim for material posses- 
sions and directly undermining mother’s efforts at 
establishing reasonable discipline. For example, grand- 
mother would confront mother in front of the patient 
that “this is my house, and child can do whatever she 
wants to when she’s here.” Mother acknowledged 
extreme difficulty communicating with her mother 
directly, as grandmother took even the slightest criti- 
cism so much to heart as to swing to the opposite 
extreme of self-denigrating comments and the asser- 
tion of intent to have nothing to do with the child in the 
future. Despite mother’s dissatisfaction with grand- 
mother’s influence, the child has continued to spend 
overnights every second weekend with grandmother 
until the family’s recent move. It seemed likely that the 
child had been attuned to the conflict between mother 
and grandmother over the years, had struggled with 
feelings of divided loyalties in her desire to “not take 
sides” with either adult, and may likely have felt “cross- 


examined” by mother’s questioning upon each occa- 
sion when she returned home from her grandparents’. 
The significant decrease in contact between child and 
grandmother since the family moved may have been 
internalized by child, the significance of which would 
require further psychologic exploration. 


+ Although mother is uncertain of the date when child’s 
comments and behavior raised concern that she might 
have been sexually molested by a male in the household 
of a babysitter, it may play a causal role. More specifi- 
cally, her play with dolls seemed more sexually explicit, 
she tearfully refused to return to the babysitter, and 
alluded to a man there having “hurt her.” In mother’s 
retrospective account, when child was interviewed by 
Social Services, they “couldn’t get a word out of her.” 


+ The child’s pet cat ran away, a major loss of an important 
attachment about which the child was still frequently 
sad and voiced despair that mother’s postponement in 
getting a new cat would “be put off forever.” 


Individual Evaluation of Patient 


In playroom interaction, after psychiatrist explained his 
function as “helping kids with their worries,” the child 
made clear (through nonverbal gestures and a few very 
high-pitched, almost unintelligible words) that her great- 
est upset was because of her “cat” having run away. As he 
introduced other issues for her consideration, she acknowl- 
edged being aware of the conflict between mother and 
grandmother, stated that her parents are primarily “wor- 
ried” about her current voice problem (in contrast with 
other emotions of being “sad, angry, or puzzled,” which 
the psychiatrist had listed as alternative suggestions), and 
seemed quite content with the issue of her baby brother, 
despite having described him as “fat” and stating her wish 
that he had been “a girl.” 

Clinically, the psychiatrist was impressed by the child’s 
interpersonal demeanor; her initial reticence giving way 
to a coquettish, bantering, playfully bemused and teasing 
style in the use of her voice disturbance in their interper- 
sonal interaction. She would frequently giggle, cover her 
mouth with her hand, reveal a faint smile, and then act as 
though she were just about to say something important 
before stopping herself prior to any vocalization while 
watching attentively for his response. 

At the conclusion of their interaction, he asked the child 
directly if she would want her voice to change back to nor- 
mal; she unhesitatingly responded, “No!” 

The psychiatrist was reminded of the mother’s summa- 
rizing comment in which she had typified the child’s per- 
sonality as “being unable to get a word out of her” long 
before her current voice problem began. 


Diagnostic Impression 


Somatoform disorder (psychogenic voice disturbance) 
secondary to multifaceted adjustment conflicts. 


Recommendations 


Ideally, a psychodynamically oriented play therapy would 
be recommended to further explore the child’s percep- 
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tion of family dynamics. However, under circumstances of 
her current geographical location and limitations of fam- 
ily resources, recommendations are for counseling to be 
arranged with her school psychologist or through other 
community resources. Mother gave verbal permission for 
psychiatrist to contact a speech-language pathologist 
and Human Services. The specifics of a treatment plan 
had yet to be determined; mother knew a psychiatrist 
remained available as needed. 


Postscript 


This very complicated case history demonstrates how a 
speech-language pathologist and psychologic services 
interact to get a more complete history. It also shows how 
much more detail is obtained by the psychiatrist, how 
patients may not always be reliable information providers, 
and that not every patient is motivated to change. It is 
critical that clinicians learn the danger signs, the signs 
that the problem is deeper than can be handled with 
direct symptom management alone, and the signs that 
no change is likely to take place without psychotherapy. 


Case Study 13.15 Muscle Tension Dysphonia 


History 


This patient is a 54-year-old female hospital administrator 
who saw an otolaryngologist after 3 weeks of “hoarse- 
ness” following “catching a cold.” 

Otolaryngologic examination produced the following 
statement: “Appearance of vocal folds okay but they do 
not move properly. No inflammation. No tumor. No ulcer. 
Nose, negative. Neck, negative.” 


Impression 


“Functional aphonia. Refer to speech pathologist.” 

The speech pathology examination began with a his- 
tory that corroborated the onset of a bad cold 3 weeks 
prior to coming to the clinic. During this period, the 
patient claimed she coughed a great deal and stated 
that after the coughing episodes, “I lost my voice and | 
just haven’t recovered it.” The history revealed that she 
had been having such voice losses since her 20s lasting 
2 to 3 days. She was now unable to work at her adminis- 
trative duties because of her voice. She said her work 
had always been especially tense and that she felt 
exhausted. 


Examination 


Her voice was practically aphonic (i.e., whispered with 
momentary squeaks and squeals). She had mild sen- 
sitivity to pressure in the region of the suprahyoid and 
infrahyoid musculature. With manual circumlaryngeal 
manipulation, she was able to achieve some voice. 


Impression 
Psychosomatic (musculoskeletal tension) reaction. 


300 Clinical Voice Disorders 


Recommendation 
Voice therapy. 


Therapy Summary 


Session 1: Using manual circumlaryngeal manipulation 
and talking voice therapy produced ~75% improve- 
ment over a 30-minute period. She transitioned from 
aphonia to a spastic or spasmodic-like dysphonia. 


Session 2: Continuation of a spastic, strained voice, 
although better than in session 1. 


Session 3: The voice continued to improve but still con- 
tained evidence of laryngospasm. 


Session 4: The patient disclosed that she had a complete 
loss of voice yesterday. The voice was breathy through- 
out the session. 


Session 5: Voice was moderately strained, “spastic.” 
Patient indicated she suspects contributing factors 
were stress at work and being on the phone all day. 


Session 6: Aphonic again. Patient does not know why as 
work situation appeared to be better. 


Session 7: Patient seen for follow-up. Voice had returned 
to normal. Voice began to improve shortly after she 
decided to leave her job at the hospital. Since that 
decision, not only voice has been normal but fatigue 
has lifted. 


Postscript 


This case of muscle tension dysphonia (MTD) took longer 
to remedy than is often the case because the etiologic 
basis of the tension, her job, was present throughout the 
period she was in voice therapy. This is not always the case. 
Many MTD patients come for treatment because the voice 
problem has been maintained long after the original cause 
has been eliminated. It is also not uncommon for these 
patients to exhibit a relapse as occurred in session 6. How- 
ever, most of the time it is self-remedied in 1 or 2 days. 


Case Study 13.16 Psychogenic Hoarseness 


History 


Mrs. S., 41 years of age, was examined for a voice disorder 
that she believed began after what appeared to be “bron- 
chitis.” Her voice symptoms remained episodic for 7 
months from which time she reported constant dyspho- 
nia. She said that her voice improved with antibiotics. 


Otolaryngology Examination 


ENT examination failed to disclose any pathology, other 
than irritation of the vocal folds, which, apparently, did 
not account for the severity of her dysphonia. 


Speech-Language Pathology Examination 


The voice could be characterized as moderately hoarse, 
harsh, breathy, and diplophonic. She had clear elevation 





of the laryngeal-hyoid complex, musculoskeletal tension, 
and pain in response to palpation of this region. Brief 
kneading in the thyrohyoid region, although painful, pro- 
duced almost immediate voice improvement indicating a 
musculoskeletal tension-related voice disorder. 

Psychosocial history was highly positive and in all prob- 
ability etiologically behind her voice disorder. Practically 
immediately after having been asked about her personal 
life, the patient broke down and cried, citing the following 
major conflict that has been in existence during the past 3 
to 4 years. The patient had a twin sister of whom the 
patient’s husband strongly disapproved, because he 
believed that she was manipulative and dishonest in her 
relationships with the rest of the family. The patient, on 
the other hand, wanted to be closer to her twin sister, par- 
ticularly during the past 3 to 4 years because of the suicide 
of her nephew at age 18. The patient’s husband, whom 
she regarded as the patriarch of the family, would have 
nothing to do with the patient’s sister and, moreover, 
objected to any attempt on the part of the patient to 
develop a closer relationship with her sister. He would not 
discuss the matter and would become angry at any of the 
patient’s attempts to make social contact or to discuss her 
sister. 

Consequently, the patient was in conflict; although she 
intellectualized that she regarded her husband as the final 
word in family matters, this was one issue that she could 
not rationalize out of existence. She was angry at him but 
could not stand up to him. She was depressed. 

She talked about her obesity, about her always being 
tired, but she could not seem to grapple with any of these 
issues because she felt her husband prevented her from 
asserting herself in her attempts to make her life better. 
She disagreed with him, for example, about rules pertain- 
ing to their 16- and 18-year-old children. Her husband 
always had the last word on matters of discipline, money, 
and other family matters. The patient said that on more 
than one occasion, she had considered going for psychi- 
atric help because she knew that she was unhappy with 
the family situation, but her husband controlled the purse 
strings and prohibited her from seeing anyone for psycho- 
logic help, because he did not believe in it. “He’s always 
right and others are always wrong.” 

The patient was really terribly upset about the entire 
situation, cried, and expressed herself as being in a 
dilemma; “I’m confused and mixed up. | don’t know what 
to do.” 


Impression 


This patient exhibits a psychogenic dysphonia secondary 
to suppressed anger and depression in response to a 
chronic conflict between her need for reconciliation with 
her sister and her fear of her husband’s disapproval. There 
are other issues of this patient’s life that are probably 
unresolved as well because of her fundamental inability to 
negotiate on an even plane with her husband. 

It is very important to note that this patient’s voice 
became completely normal during periods of emotional 
disclosure of her feelings concerning the above matters. 


Recommendation 


Voice or speech therapy was not recommended as the 
solution to this patient’s problem. Whatever gains would 
be made on the basis of symptomatic therapy, the clini- 
cian thought they would be short-lived, owing to the per- 
sistence of the conflict that she was experiencing. The 
clinician and patient talked at considerable length about 
the need for the patient to consult a psychiatrist, psychol- 
ogist, or psychiatric social worker. Even if her husband was 
intransigent, at least she might be able to be more 
assertive to achieve what was best for her own future hap- 
piness. Future movement in this direction was hampered 
by the patient’s not knowing where to turn next. Obvi- 
ously, she could not confront her husband with what she 
had learned in the evaluation, because there was no way 
he would understand, she said. Her sole conduit to get- 
ting psychologic help was her personal physician. 

Her husband, according to the patient, trusted her local 
physician. She had great confidence in him and was “the 
only person my husband would listen to.” The patient was 
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strongly encouraged to contact him and say that the 
speech-language pathologist would provide any informa- 
tion necessary to persuade the physician that this 
patient’s voice problems appeared directly and seriously 
related to the family problems noted. 


Postscript 


This case history provides an illustration of a clinical 
dilemma often faced by voice clinicians; to treat or not to 
treat. On the one hand the patient responded positively to 
manual re-posturing of the larynx and also exhibited normal 
voice during periods of emotional disclosure of her feelings. 
This indicates normal voice is possible but at what cost? 
Voice therapy is probably ill advised if the voice gain is pre- 
dicted to be temporary and if voice therapy might delay a 
psychiatric consult that could get at the etiology of the voice 
disturbance. The decision to treat or not needs to be made 
onan individual basis weighing in the person’s motivation to 
achieve voice change, psychologic state and traits, insur- 
ance coverage, and availability of professional help. 
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Diphtheria, 27 
vocal fold paralysis caused by, 81 
Diplophonia, 81 
Direct activating system. See Corticobulbar tract(s) 
DNA microarray technology, 29 
Down syndrome 
airway anomalies in, 325 
infant cries in, 28 
vocal characteristics in, 28 
Dynamic range, of professional singer, 279-280 
Dysarthria(s), neurologic disorders and, 71 
Dyskinesia, 90-91, 91t 
buccolingual, 116 
orofacial, 90-91, 116 
Dysmetria, 89 
Dysphonia 
apraxic, 94, 94t. See also Apraxia 
ataxic, 89, 90t 
in children, and personality, 167 
conversion, 182, 232-233 
definition of, 2 
flaccid 
pathophysiology of, 79-81, 79f, 80f, 80t, 81f 
vagal lesions and, 73-84 
frontal lobe lesions causing, 95-96 
functional 
differential diagnosis of, 88 
psychiatric characteristics of patients with, 183t 
hyperkinetic (choreic), 61-62, 89-90, 90t 
hyperkinetic (dystonic), 90-91, 91t 
hypokinetic (parkinsonian), 87-89, 89t. See also Parkinsonism 
of hypothyroidism, 31 
incidence of, 7 
mixed flaccid-spastic (pseudobulbar), 86-87 
muscle tension, case study, 299-300 
musculoskeletal tension, 172-173 
of neurologic disease, 71 
psychogenic 
case study, 294-295 
differential diagnosis of, 82-84 
treatment of, 195 
psychosocial effects of, assessment of, 136-137, 136f, 137f 
as sign or symptom, 4 
spasmodic. See Spasmodic dysphonia 
spastic, 101-102. See also Spasmodic dysphonia 
spastic (pseudobulbar), 84-86 
laryngeal-phonatory characteristics of, 85, 85t 
transient, in adolescent males, 15 
trauma-related, 34 
types of, 2 
ventricular, 178 
Dysphonia plicae ventricularis, 177 
Dyspneumia 
ataxic, 351 
definition of, 350 
dystonic, 352 
flaccid, 350-351 
hyperkinetic, 351-352 
hypokinetic, 351 
spastic, 351 
Dysprosody, 96 
Dyssynergia, 89 
Dystonia, 90-91, 91t, 113, 263 
adductor spasmodic dysphonia of, 115-117 
definition of, 115 
focal, 115 
laryngeal, 115 
mouth, 90-91 
laryngeal, 115 
torsion, 116 
voice disorder caused by, 352 
Dystonia musculorum deformans, 90 
Dystonic dyspneumia, 352 


E 
Earth, age of, 302, 303f 
Ear training, 234 
Echolalia, 93 
Ectoderm, 312, 315, 320f, 321f 356 
EGG. See Electroglottography (EGG) 
Elastic cartilage, 357 
Elastic fibers, 357 
Elastic recoil 
of lungs, 332 
of ribs, 332 
of viscera, 332 
Electrical stimulation, of denervated muscle, for selective 
reinnervation, 244 
Electroglottography (EGG), 150-151, 151f, 152f 
contact index, 151 
contact quotient, 151 
in older adults, 18 
signal 
contact phase, 151, 151f 
in loft (falsetto) register, 151, 151f 
minimal contact phase, 151, 151f 
in modal register, 151, 151f 
in pulse (vocal fry) register, 151, 151f 
qualitative variations in, and associated vocal fold 
adjustments, 151, 152f 
skirt bulging in, 151, 152f 
skirt elevation in, 151, 152f 
skirt ramping in, 151, 152f 
Electrolaryngography. See Electroglottography (EGG) 
Electromyography (EMG) 
laryngeal, 161 
velopharyngeal, 48-49, 48f 
Embryology, 302 
of anterior commissure, 320-321 
of bronchi, 322, 324f 
of cleft lip, 51-53, 52f-53f 
of cleft palate, 51-53, 52f-53f, 359 
of epiglottis, 358, 358f 
of face, 51-53, 52f-53f 
of head and neck, 312-313, 315f 
knowledge gained from, 312 
of larynx, 319-321, 323f, 358-359, 358f 
of lips, 51-52 
of lungs, 322, 324f, 329 
of palate, 52-53, 52f-53f 
of parathyroid gland, 316-317, 321f, 322f 
of pharyngotympanic tube, 322f 
of respiratory system, 321-325, 323f, 324f 329 
of thymus gland, 317, 321f, 322f 
of thyroid gland, 322f 
of tongue, 318-319, 320f, 321f, 322f 
of trachea, 321, 323f 
of tympanic cavity, 322f 
of vocal folds, 320, 323 
Emotion(s) 
and effects of vocal abuse, 173 
fundamental frequency and, positive 
association of, 168 
primary process, 168 
and vocalizations, 1-2 
voice as indicator of, 168-169 
Emotional lability (incontinence), 86 
Empyema, vocal fold paralysis caused by, 81 
Encephalitis, akinetic mutism caused by, 95 
Endocrine disorders, voice changes in, 31-34 
Endoderm, 312, 315, 316, 320f, 321f, 356 
Entropy measures, 147 
Epiglottis, 359f, 360, 360f 
development of, 11 
embryology of, 358, 358f 
in infant, 11 
Epiglottitis, 27 
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Epithelium, 355-356 
classification of, 356 
of epiglottis, 356 
functions of, 356 
of glottis, 356 
simple, 356 
simple squamous, 356 
stratified, 356 
of trachea, 356 
of vocal folds, 356, 365, 365f 
Equal appearing interval (EAI), 138, 138f 
Error Pattern Diagnostic Articulation Test, 66, 67f 
Esophageal reflux. See also Gastroesophageal reflux disease 
(GERD); Laryngeal esophageal reflux 
treatment of, 247 
and voice, 35 
Essential tremor. See also Organic (essential) voice tremor 
effects on voice, 92t 
voice disorder caused by, 91, 114-115, 114f 221-223, 222f 223f 
Essential tremor syndrome, 114 
Etiology 
and classification of voice disorder, 5 
definition of, 5 
Eunuch(s), 185 
Eunuchoid voice, 185 
Eupnea, definition of, 349 
Eustachian tube, 316 
patulous, 63 
Evidence-based medicine, 233-234 
Evolution, 2, 302 
laryngeal, 302-303, 304f 
of respiration, 327-328 
Exhalation, quiet, 332 
Expiration, 327 
Expiratory reserve volume, 147 
External laryngeal nerve, 367 
lesions, voice disorders caused by, 74 
Extracellular matrix (ECM), of lamina propria, 29 


F 
Fo. See Fundamental frequency 
Face 
development of, 317f-318f, 358-359, 358f 
embryology of, 51-53, 52f-53f 
Fach, 274, 278 
Facial nerve. See also Cranial nerve(s), VII 
laryngeal innervation by, 366-367 
Facilitating approaches, 4 
Factory workers, laryngeal pathology in, 8 
Falsetto, persistent, 185 
Family conflict, and dysphonia in children, 174, 249 
Fast Fourier transform (FFT), 144 
Fatigue 
definition of, 245 
illness-related, voice disorders caused by, 350 
vocal, 245-247 
and voice function, 135 
Feedback. See also Biofeedback, in voice therapy 
auditory, 234 
devices, in voice therapy, 241-242 
in voice therapy, 231 
Female(s) 
adolescent 
voice change in, 184 
age of onset and duration of, 16 
voice characteristics in, 15 
adult 
fundamental frequency in, as function of age, 17, 17t 
laryngeal dimensions in, 11, 12t 
phonational frequency range in, 19, 20f 
elderly, voice changes in, 17-18, 19 
fundamental frequency in, as function of age, 19 
laryngeal disease in, 7 


laryngeal growth, at puberty, 16 
postmenopausal, voice changes in, 15 
psychogenic voice disorders in, 8 
Feminization 
pathologic, voice changes in, 33 
of speaking voice, 264-265 
of speech content, 265 
Fetal crying, 12 
Fibrocartilage, 355, 357 
Flaccid dyspneumia, 350-351 
Flexibility 
abnormal, 3 
of normal voice, definition of, 3 
Flow 
mean (DC), 148 
measurement of, 148 
Flow glottogram, 149, 149f 
Flow-volume curve, 337, 338f 
Foramen cecum, 317, 320f 321f, 322f 323f 
Forced expiratory volume in 1 second (FEV;), 339 
Forced vital capacity (FVC), 339 
Foreign accent syndrome, 73, 96-97, 96t 
case study, 96 
Formant(s), 140 
singer's, 272t, 274 
Formant frequencies, 371 
Forte, 272t, 277 
FOXP2 gene, 310 
Frequency. See also Fundamental frequency 
change, in adolescent voice, 16 
Frequency range 
in adult males versus females, 19, 20f 
in childhood, 15 
of musical voice, 15, 16f 
of physiologic voice, 15, 16f 
Fricative(s) 
glottal, 57f, 58 
pharyngeal, 57f 58 
posterior nasal, 57f, 58 
velar, 57f, 58 
Frog 
larynx of, 303, 305f 
pulmonary function in, 327 
Frontal lobe lesions, 206, 206f 
bullet-wound, akinetic mutism caused by, 95 
dysphonia, aphonia, and mutism caused by, 
95-96 
mutism caused by, 95 
Frontal lobe syndrome, 73 
Frontonasal prominence, 51, 52, 317f 
Functional residual capacity (FRC), 339, 339f 
Functional [term], 166, 171 
Functional voice disorder(s), 6, 166, 182-184. See also 
Psychogenic voice disorder(s) 
definition of, 171 
and personality, 169 
psychiatric characteristics of patients with, 183t 
Fundamental frequency, 347-348 
abnormal, causes of, 143 
acoustic analysis of, correlation with subjective 
judgments of emotion, 168 
in adolescence, 15 
in adulthood, 16-17, 17t, 143, 143f 
in childhood, 14-15 
as function of age, 15, 17, 17t, 19, 19f 143, 143f 
of infant cries, 13 
measurement of, 142-143, 143f 
in older adults, 19, 19f 
sex differences in, 15, 143, 143f 
stress and, 170 
subglottic air pressure and, 14, 348, 348f 349f 
Fundamental period (T), of inverse filtered waveform, 
149, 149f 
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G 
Gag reflex 
absence of, 61 
hypoactive, 61 
reduction of, 61 
Gastroesophageal reflux disease (GERD), 274-275. 
See also Esophageal reflux; Laryngeal 
esophageal reflux 
in singers, 274 
and voice, 35 
Gender identity disorder, 33 
Gender reassignment. See Transsexualism 
Gene expression, in vocal lesions, 29 
Genetics 
of congenital anomalies, 324-325 
of cri du chat syndrome, 24, 25f 
of respiratory malformations, 324-325 
of velocardiofacial syndrome, 324 
Geniohyoid muscle, 361, 362f 
innervation of, 75, 367 
Gibbons, vocalizations of, 369 
Gigantism, 32 
Gill clefts, 358. See also Branchial grooves/clefts 
Gilles de la Tourette’s syndrome 
dysarthria of, 93-94, 94t 
haloperidol therapy for, 94, 263 
voice disorder of caused by, treatment of, 263 
Gills, 313, 327 
Globus sensation, 172, 206 
and laryngeal esophageal reflux, 172 
and psychogenic disorders, 172 
Glossopharyngeal nerve. See Cranial nerve(s), IX 
Glottal area, 355 
and glottal resistance, 347, 347f 
Glottal closure pattern, videostroboscopy of, 155 
Glottal fricative, 57f, 58 
Glottal resistance 
airflow rate and, 347, 348f 
glottal area and, 347, 347f 
Glottal spectrum, 140, 140f 
Glottal stop, 57f, 58 
Glottic closure, incomplete, 205, 205f 
Glottic web(s), 48 
Glottis, 355, 362, 364 
Goldenhar syndrome, 54 
Granuloma(s) 
laryngeal, 34f 35 
vocal fold paralysis caused by, 81 
GRBAS protocol, 138 
Greater palatine artery, 45 
Greater palatine foramen, 44, 44f 
Grimacing 
mute, 86 
nasal and/or facial, definition of, 42 
Grunting, 101 


H 
Haemophilus influenzae, 27 
Haloperidol, for Tourette’s syndrome, 94, 263 
Hard palate, 42, 43, 43f, 44f. See also Cleft palate; Palate 
anatomy of, 44, 44f 
blood supply to, 45 
cleft, 51t, 55f 
configuration, assessment of, 66 
Harmonics-to-noise ratio (HNR), 146 
Head 
injury, and foreign accent syndrome, 96 
myotomes, embryology of, 315f 
Hearing loss, conductive, cleft palate and, 59 
High pitch. See also Voice cluster(s), high-piched group 
psychogenic, case study, 297-299 
History-taking, 134-137 
HNR. See Harmonics-to-noise ratio (HNR) 


Hoarseness, 3. See also Adductor spasmodic dysphonia 
causes of, differential diagnosis of, 247t 
of hypothyroidism, 31 
with laryngeal cancer, 30 
psychogenic, case studies, 285-286, 292-293, 300-301 
as sign of illness, 4 
wet, 86 
Homemakers, voice disorders in, 8 
Hormone(s) 
effects on voice, 274 
female, and voice changes, 33-34 
sex, and voice changes, 33-34 
Howler monkeys, vocalizations of, 369 
Human papillomavirus (HPV), and papillomas, 26 
Humming, health benefits of, 237 
Huntington's disease, dysphonia in, 89-90, 90t 
Hyaline cartilage, 357 
Hydration 
poor, 233 
for professional singer, 276 
for vocal fatigue, 246 
Hydrocephalus, mutism in, 95 
Hyoid arch, 313, 313f, 314f, 315, 320f, 358-359, 358f 
derivatives of, 319t 
Hyoid bone, 309-310, 357f 359f 361, 362f 
anatomy of, 361 
body of, 359f, 361 
functions of, 10 
major horns of, 359f, 361 
minor horns of, 359f, 361 
movements of, 367-368, 368f 
ossification of, 11 
position of, 355, 361, 3563f 
sling suspension of, 367, 368f 
Hypercapnia, definition of, 350 
Hyperfunction, vocal, 5 
Hyperkinetic (choreic-dystonic) dysarthria, 61-62, 89-90, 90t 
Hyperkinetic dyspneumias, 351-352 
Hypernasality, 39 
acoustic studies of, 50 
case study, 40-41 
with cleft palate, 56, 59 
definition of, 41-42 
etiology of, 42, 42t 
flaccid, neurologic causes of, 61 
functional, 62 
hyperkinetic (choreic-dystonic), neurologic causes of, 62 
mixed flaccid-spastic, neurologic causes of, 62 
in neurologic disease, 61-62 
nonorganic, 62 
organic, 50-62 
semiobjective tests for, 64-65 
spastic (pseudobulbar), neurologic causes of, 61-62 
Hyperpituitarism, voice changes in, 32 
Hyperpnea, definition of, 349 
Hyperrhinolalia, 41 
Hyperthyroidism, voice changes in, 32 
Hypnosis, for adductor spasmodic dysphonia, 121 
Hypobranchial eminence, 315, 320f 323f 
Hypocapnia, definition of, 350 
Hypofunction, vocal, 5 
Hypoglossal nerve, laryngeal innervation by, 366-367 
Hypogonadotropic hypogonadism, isolated, voice changes in, 33 
Hypokinetic dysarthria, 61 
Hypokinetic (parkinsonian) dysphonia, 87-89, 89t. See also 
Parkinsonism 
Hypokinetic dyspneumia, 351 
Hyponasality, 39 
definition of, 42 
etiology of, 42, 42t 
organic, 62-63 
patulous eustachian tube and, 63 
semiobjective tests for, 64-65 
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Hypopnea, definition of, 350 
Hypothyroidism 
definition of, 31 
voice changes in, 31 
case studies, 31-32 
treatment of, 263 
Hypoxemia, definition of, 350 
Hypoxia, definition of, 350 
Hysteria, 200 
and conversion, differentiation of, 178 
criteria for, 178-179 
definition of, 178 


I 
Iatrogenic [term], definition of, 188 
Iatrogenic respiratory disorders, 352 
Iatrogenic voice disorders, 188, 352 
Iatrogenic voice rest, 188-190 
case studies, 189-190 
Idiopathic [term], definition of, 79 
Idiopathic vocal fold paralysis, 79 
Illicit drug use, history-taking about, 135 
Incisive fossa, 44, 44f 
Incomplete mutation, 185-186 
Indole-3-carbinol, 26 
Infancy 
laryngeal trauma in, 28 
neurologic voice disorders in, 73 
voice development in, 10-14 
Infant cries, 12-14, 24. See also Cri du chat syndrome 
birth signal, 12 
in Down syndrome, 28 
duration of, 13 
evolution of, 12-13 
facial expression associated with, 14 
fundamental frequency of, 13 
hunger signal, 12 
listener identification of, 13-14 
malnutrition and, 27-28 
neurologic disease and, 73 
pain signal, 12 
physiologic bases of, 14 
pleasure signal, 12 
signal function of, 12-13 
types of, 12 
Infection(s) 
differential diagnosis of, 247t 
epidemiology of, 7 
upper respiratory, and voice disorders, 107 
vocal fold paralysis caused by, 81 
Inferior laryngeal artery, 366, 366f 
Inferior laryngeal vein, 366 
Inferior pharyngeal constrictor muscle, innervation of, 367 
Inflammation, 6t 
and aphonia/dysphonia, in childhood, 27 
epidemiology of, 7 
Infraglottal dimensions, sex differences in, 15 
Infrahyoid muscles, 361 
innervation of, 75 
Inhalation, quiet, 331-332 
Insight, 199 
Inspiration, 327 
Inspiratory reserve volume, 147 
Intensity, vocal, 3, 144 
control of, 349 
elevated, causes of, 144 
reduced, causes of, 144 
subglottic air pressure and, 144, 349, 349f 
Intensive voice treatment (LSVT), in parkinsonism, 89 
Intention tremor, 89 
Intercostals externi, functional anatomy of, 332, 333t 
Intercostals interni, functional anatomy of, 332, 333t 
Intermaxillary suture, 44, 44f 


Internal laryngeal nerve, 366f 367 

lesions, voice disorders caused by, 74 
Interpalatine suture, 44, 44f 
Interviewer 

successful, characteristics of, 197-199 

voice of, 193 
Intonation 

in infancy, 14 

stages of, in development of language, 14 
Intraoral pressure (Pio), 49, 56, 150, 150f 

in consonant production, 49, 49f 

reduced, with cleft palate, 56, 59 
Intrapleural pressure, during respiratory cycle, 337, 338f 
Intubation 

laryngeal trauma in, 34 

prolonged, voice disorder caused by, 35 
Inverse filtering, 148-149, 149f 

waveform, time-based measures, 149, 149f 


J 

Jaw(s), upper, embryology of, 51-52 
Jitter, definition of, 146 

Jugular foramen syndrome, 81 


L 
Language development 
delayed, with cleft palate, 59 
evaluation of, 66-68 
intonational stages of, 14 
Laryngeal air sacs, in apes, 310 
Laryngeal airway resistance (Rjaw), 150 
Laryngeal carcinoma, 205, 205f 
Laryngeal chemoreflex (LCR), 322 
Laryngeal esophageal reflux, 233. See also Esophageal reflux; 
Gastroesophageal reflux disease (GERD) 
and contact ulcers, 177 
signs and symptoms of, 173 
Laryngeal framework surgery, 243 
for adductor spasmodic dysphonia, 121 
Laryngeal inlet valve, 310 
Laryngeal intubation, and contact ulcers, 177 
Laryngeal ligaments, 361 
Laryngeal membranes, 361 
Laryngeal muscles, 327 
extrinsic, 361, 362f 
innervation of, 74-75, 366 
intrinsic, 361-362 
innervation of, 366 
vagal lesions and, 77 
Laryngeal outlet valve, 310 
Laryngeal reinnervation surgery, 243-244 
Laryngeal scarring, iatrogenic, 243 
Laryngeal stridor 
congenital, 25 
inhalatory, in cri du chat syndrome, 25 
Laryngeal valving, 346-347, 347f, 348f 
Laryngeal ventricle(s), 320, 355, 364 
Laryngeal web(s), 205, 205f 
causes of, 26 
congenital, 26, 26f 
Laryngitis 
acute, 27 
chronic 
hypertrophic, 28-29 
nonspecific, 28 
in pop/cabaret singers, 176 
simple, 28-29 
Laryngitis sicca, 28 
Laryngomalacia, 25 
Laryngopharyngeal reflux (LPR). See also Esophageal reflux; 
Gastroesophageal reflux disease (GERD); Laryngeal 
esophageal reflux 
course of, 135 
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definition of, 247 
differentiation from other causes of hoarseness, 247, 247t 
in singers, 274-275 
symptoms of, 274-275 
treatment of, 247, 247f 
Laryngoscopy 
direct, 154 
flexible endoscopy for, 153-154 
indirect, 152-154, 153f 
in professional singer, 280-281 
mirror, 152-153, 153f 
parameters of interest in, 154 
rigid endoscopy for, 153, 153f, 154f 
Laryngospasm 
adductor, 102. See also Adductor spasmodic dysphonia 
differential diagnosis of, 102-103 
Laryngotracheal cleft, 26 
Laryngotracheal groove, 320f, 323f, 329, 358 
Laryngotracheobronchitis, 27 
Laryngo-tracheo-bronchomalacia, 25 
Larynx 
adult, anatomy of, 355, 3563f 
age-related changes in, 17-18 
anatomic orientation of, 359-360 
anatomy of, 1, 308-309, 355, 359-364, 3563f 
autonomic innervation of, 75 
biomechanics of, 367, 368f 
cartilages of, 360-361, 360f 
movements of, 368-369, 368f 
descent of, 10, 15, 308-309 
across lifespan, 10, 11f 
in non-human animals, 10-11, 308-309 
at puberty, 16 
development of, 357-359 
histologic, 10, 11-12 
morphologic, 10, 11 
topographic, 10 
dimensions of 
in adults, 11, 12t 
in infancy, 11, 12t 
at puberty, 11, 12¢ 15 
edema of, vocal abuse and, 173 
effort closure functions of, 369-370, 369f 
elevation, in musculoskeletal tension, evaluation of, 244, 245f 
embryology of, 319-321, 323f 358-359, 358f 
evolution of, 302-303, 304f 
functions of, 1-2, 310 
inflammation of, vocal abuse and, 173 
infraglottic, 362, 364f 
innervation of, 366-367, 367f 
interior of, 362-364 
levels (tiers) of, 362, 364f 
in laryngomalacia, 25 
lymphatics of, 366 
malignant tumors, 30, 31f 
mammalian, evolution of, 327-328 
maneuvering, to lower position in neck, 244-245, 245f 
microstructure of, development of, 322-323 
movements of, 367-368, 368f 
neonatal, 10 
neurophysiology of, 368-369 
opening of, arytenoid cartilages and, 327-328, 327f 
phonatory functions of, 370-374 
physiology of, 369-375 
position of, 355, 3563f 
in mutational falsetto, 186, 186f 
normal, 186, 186f 
postnatal, 10-12 
protective functions of, 327-328 
in respiration, 334-335, 335f 
respiratory functions of, 369 
sling suspension of, 367, 368f 
species differences in, 303-311 


swallowing functions of, 370, 370f 
trauma to, 135 
in adulthood, 34-35, 34f 
in neonate/infant, 28 
vascular supply of, 366, 366f 
visualization of, 151-160 
image analysis in, 157-160 
quantitative, 157-160, 160f 161f 
visual perceptual rating of, 157, 158f-159f 
Lateral cricoarytenoid muscle, 359f, 361-362, 363f 
Lateral palatine processes, 53 
Lateral pterygoid plate, 44, 45f 
Latissimus dorsi muscle, functional anatomy of, 333, 334t 
Laugh, in infancy, 14 
Laughing, pseudobulbar, 85-86 
Learning, in voice therapy, 232 
Lee Silverman Voice Therapy (LSVT), 242 
Lessac technique, 264 
Lesser gymnure, larynx of, 305, 306f 
Lesser palatine foramen, 44, 44f 
Leukoplakia, 28 
Levator veli palatini 
activity, in speech, 49 
anatomy of, 45, 45f 
vagal lesions and, 77 
Ligaments, 357 
laryngeal, 361 
Linear predictive coding (LPC), 144 
Linklater technique, 264 
Lip(s) 
embryology of, 51-52 
voluntary movement, assessment of, 66 
Lip trills, 237 
Loft register, 375 
Long-term average spectrum (LTAS), 144, 145f 
Loudness 
abnormal, 3 
probes for, 238 
definition of, 2 
of normal voice, definition of, 2-3 
vocal 
control of, 349 
determinants of, 374 
excessive, case study, 284-285 
subglottic air pressure and, 349, 349f 
Lower motor neuron (flaccid) dysarthria, 61 
LPR. See Laryngopharyngeal reflux (LPR) 
LSVT. See Intensive voice treatment (LSVT); Lee Silverman Voice 
Therapy (LSVT) 
LTAS. See Long-term average spectrum (LTAS) 
Lung(s) 
anatomy of, 328f 329, 330f 
characteristics of, 328-329, 329t 
compliance of, 338 
decreased, 350 
development of 
alveolar period of, 322, 329 
canalicular period of, 322 
pseudoglandular stage of, 322, 329 
terminal period of, 322, 329 
embryology of, 322, 324f 329 
evolution of, 326-327, 328-329, 329t 
gas exchange units of, 326 
recoil tendency of, 338 
Lung bud(s), 322, 324f 329 
Lung capacities, 339, 377f 
Lung volumes, 339, 377f 
during loud utterance, 349f 
during quiet breathing, 349f 
during speech, 344, 345f 346f 349f 
Lyapunov exponents, 147 
Lye, laryngoesophageal damage by, 35 
Lysol, laryngoesophageal damage by, 35 
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M 
Maculae flavae, 323 
Magnetic resonance imaging (MRI), dynamic, 48 
Magnetometry, 148 
Male(s) 
adolescent 
voice change in, 184 
age of onset and duration of, 16 
voice characteristics in, 15 
adult 
fundamental frequency in, as function of age, 17, 17t 
laryngeal dimensions in, 11, 12t 
phonational frequency range in, 19, 20f 
contact ulcers in, 177 
elderly, voice changes in, 15, 17-18, 19 
fundamental frequency in, as function of age, 19, 19f 
laryngeal disease in, 7 
laryngeal growth, at puberty, 16 
thyroid cartilage in, 11 
Malignancy, 6t, 263-264 
incidence of, 7 
laryngeal, 30, 31f 
vocal fold, differential diagnosis of, 247t 
Malnutrition, and infant cries, 27-28 
Mandible, voluntary movement, assessment of, 66 
Mandibular arch, 313, 313f, 314f, 320f 358-359, 358f 
derivatives of, 319t 
Mandibular nerve, 315 
Mandibular prominences, 51, 313, 314f, 317f, 320f 
Mania, voice as indicator of, 169 
Mass lesion(s), laryngeal, 205, 205f 
voice disorders caused by, 135 
treatment of, 263-264 
Mastoid antrum, 316 
Maxillary nerve, 315 
Maxillary prominence, 51-52, 313, 314f 317f 
Maximum flow declination rate (MFDR), 149, 149f 
Maximum phonation time (MPT), 148 
Meckel’s cartilage, 315f, 320f 
Medial pterygoid plate, 44, 45f 
Median palatine process, 52 
Medications, used by professional voice users, history-taking 
about, 273-274 
Meige’s syndrome, 113, 115-116 
Menstrual cycle 
effects on voice, 274 
and voice, 33-34 
Mesectoderm, 315 
Mesencephalon, hemangioma of, akinetic mutism caused by, 95 
Mesoderm, 312, 315, 320f, 321f 
Messa di voce exercises, 272t, 277 
Metabolic disorders, 6t 
Mid-dorsum palatal stop, 57f 58 
Mirror-clouding test, 64-65 
Mixed dystonia-tremor, 113 
Mixed lower-upper motor neuron (flaccid-spastic) dysarthria, 61 
Mixed nasality, 42, 64 
etiology of, 42, 42t 
Modal register, 375 
Mother(s) 
of infants, identification of infant cries, 13-14 
of small children, voice disorders in, 8 
Motor neuron disease, 87. See also Lower motor neuron (flaccid) 
dysarthria; Upper motor neuron disease 
Mouth breathing, 63 
Movement disorders 
extrapyramidal, and adductor spasmodic dysphonia, 116 
neurologic, 113 
MTD. See Muscle tension dysphonia (MTD) 
Mucosal wave, 374 
definition of, 155 
videokymography of, 156, 156f 
videostroboscopy of, 155 





Multiple sclerosis (MS), and vocal fold paralysis, 81 
Muscle tension dysphonia (MTD), case study, 299-300 
Muscle tissue, 357 
Muscular dystrophy, flaccid dysphonia caused by, 78-79 
Muscular tension 
stress and, 244 
testing for, 118, 244-245, 245f 
Musculoskeletal tension 
in conversion voice disorders, 233 
reduction, in contact ulcer treatment, 250 
in vocal abuse disorders, 233 
Musculoskeletal tension disorders, 6t, 113, 118, 172-173, 232, 233 
autonomic function and, 171 
and contact ulcers, 177 
diagnosis of, 172 
dysphonia caused by, case study, 299-300 
larynx in, examination of, 172 
pathogenesis of, 172 
testing for, 118 
treatment of, 118, 244-245, 250 
voice improvement after, assessment of, 244-245 
vocal hyperfunction in, treatment of, 244-245 
Musculus uvulae, anatomy of, 46 
Musical terminology, 272, 272t 
Mutational falsetto, 184, 185, 352 
laryngeal position in, 186, 186f 
psychogenic, case study, 289-290 
treatment of, 257-260, 265 
case study, 258-260 
Mutation of voice 
incomplete, 185-186 
normal pubertal, 184 
pathologic, 184-188 
classification of, 184-185 
organic, 184 
psychogenic, 185 
Mute grimacing, 86 
Muteness. See also Mutism 
conversion, 179 
case study, 295-296 
definition of, 2 
Mutism. See also Akinetic mutism; Muteness 
frontal lobe lesions causing, 95-96 
psychogenic, case study, 295-296 
Myasthenia gravis, 205, 206f 
differential diagnosis of, 83-84, 210-211 
case studies, 83-84, 291-292 
laryngeal-phonatory findings in, 78, 82, 82t 
vocal fold bowing in, 35 
voice disorder caused by, 61, 72t, 76f, 78, 81-84 
case study, 82-84 
course of, 135 
treatment of, 263 
Mylohyoid muscle, 361 
innervation of, 75, 367 
Myoelastic-aerodynamic theory, of voice production, 357f 
371-374, 371f-373f 
Myopathy(ies), flaccid dysphonia caused by, 78-79 
Myxedema, definition of, 31 


N 
Nares occlusion test of exhalatory efficiency, 65 
Nasal air emission, 39 
case study, 40-41 
with cleft palate, 56, 59 
definition of, 42 
etiology of, 42, 42t 
neurologic causes of, 61-62 
in neurologic disease, 61 
nonorganic, 62 
organic, 50-62 
semiobjective tests for, 64-65 
Nasal airflow studies, 49-50, 49f 
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Nasal and/or facial grimacing, definition of, 42 
Nasal cavity, obstruction of, 62-63 
Nasal endoscopy, 47-48 
Nasal obstruction, 62-63 
Nasal pits, 51 
Nasal placodes, 51 
Nasal prominences, 51 
Nasal resonation, vagus nerve lesions and, 72t, 75f-76f 
Nasal resonatory disorders, 39, 41-42 
classification of, 41-42 
clinical examination for, 64-68 
definition of, 41-42 
etiology of, 42, 42t 
perceptual examination for, 64-65 
Nasal semivowels, 39 
Nasal speech, 62 
Nasal vibration test, 65 
Nasolacrimal groove, 51 
Nasopharynx, large (congenital), 59-60 
National Center for Voice and Speech, Web site of, 274 
Neck, myotomes, embryology of, 315f 
Neonate(s), laryngeal trauma in, 28 
Neoplasms, 263-264 
benign, 6t 
Neural crest cells, 315 
Neurasthenics, 350 
Neurochronaxic theory, of voice production, 364 
Neuroectoderm, 315 
Neurologic disease(s), 6t, 205, 205f 
affecting phonation, 350-351 
flaccid dysphonia in, 81 
and foreign accent syndrome, 96 
vocal fold paralysis in, 81 
Neurologic voice disorder(s), 61-62, 71-100 
in adults, 73-97 
arrhythmically fluctuating, 73, 89-91 
case study, 208-209 
in children, 73-97 
course of, 135 
differential diagnosis of, 108 
in infants, 73 
with loss of volitional phonation, 73, 94-97 
paroxysmal, 73, 93-94 
relatively constant, 73-89 
rhythmically fluctuating, 73, 91-93 
treatment of, 263 
types of, 71 
Neurosis, 199 
Nodules, vocal. See Vocal nodules 
Nonlinear dynamic analysis, of voice disorder, 
146-147 
Nonlinear dynamic systems, 147, 147f 
Nonlinear dynamic theory, 147 
Nonvibratory segment 
definition of, 155 
presence of, videostroboscopy of, 155 
Nucleus ambiguus, 73 
damage to, 61 
Nyquist plot, 159-160 





o 
Oblique arytenoid muscle, 359f, 362, 363f 
Obliquus externus abdominis muscle, functional anatomy 
of, 333-334, 334t 
Obliquus internus abdominis muscle, functional anatomy 
of, 333, 334t 
Obsessive patient, 199-200 
Occupation 
and vocal abuse, 174, 233 
and voice disorders, 7-8 
Older age 
and laryngeal pathology, 7 
voice changes in, 17-18 


Omohyoid muscle, 361, 362f 

innervation of, 75, 367 
Ontogeny, 313 

definition of, 302 
Open nasality, 41 
Open quotient (0.Q.), 374 

age-related changes in, 18 

of inverse filtered waveform, 149, 149f 
Oral contraceptives, and voice, 33-34 
Organic voice disorders. See Voice disorder(s), organic 
Organic (essential) voice tremor, 91-92, 92t, 114-115, 114f, 

351-352. See also Essential tremor 
abductor spasmodic dysphonia of, 91 
adductor spasmodic dysphonia of, 91, 114-115, 114f, 221-223, 
222f, 223f 
emotional stress as precipitant of, 115 

treatment of, 263 
Orofacial dyskinesia, 115 
Oropharyngeal membrane, 313-315, 314f, 320f 
Oro-pharyngo-laryngeal system, development of, 358-359 
Orthopnea, definition of, 350 
Otolaryngologist, referring, letter to, 218 


P 
Pain, in laryngeal musculoskeletal tension, 172 
testing for, 244, 245f 
Palate. See also Cleft palate; Hard palate; Soft palate 
congenitally short, 59-60 
embryology of, 52-53, 52f-53f 
injury to, 60 
primary, 52 
secondary, 53 
trauma to, 60 
Palatine bone, horizontal plate of, 44, 44f 
Palatoglossus muscle 
activity, in speech, 49 
anatomy of, 46 
Palatomaxillary suture, 44, 44f 
Palatopharyngeal insufficiency. See also Cleft lip; Cleft palate 
compensatory articulatory and phonatory substitutions in, 
56-59, 57f 
reduced intraoral pressure with, 56 
Palatopharyngeus muscle 
activity, in speech, 49 
anatomy of, 46 
Palatopharyngolaryngeal myoclonus, 62, 92-93, 93t 
case study, 92 
epidemiology of, 93 
etiology of, 93 
Papilloma(s), laryngeal, in children, 26-27, 27f 
Paradoxical vocal fold dysfunction (PVFD), treatment of, 262-263 
Parathyroid gland 
embryology of, 316-317, 321f, 322f 
II (inferior), 316 
inferior, 321f 
IV (superior), 317 
superior, 321f 
Parietal pleura, 322, 324f 
Parkinsonism, 206, 206f 
case study, 208-209 
dysphonia of, 87-89, 89t 
case study, 88 
vocal fold bowing in, 35 
voice disorder caused by, treatment of, 263 
Passagio, 272t, 273 
Passavant’s ridge, 46 
Pectoralis major muscle, functional anatomy of, 333, 334t 
Pectoralis minor muscle, functional anatomy of, 333, 334t 
Pediatric Voice Outcome Survey (PVOS), 137 
Perceptual attributes, and classification of voice disorder, 5 
Perceptual studies. See also Auditory perceptual analysis 
of speech, 50 
of voice sound changes, 18-20 
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Pericardio-peritoneal canals, 322, 324f 
Periodicity, of vibratory cycle 
definition of, 155 
videostroboscopy of, 155 
Persistent falsetto, 185 
Personality 
and contact ulcers, 177 
and conversion voice disorders, 180 
definition of, 167 
of dysphonic children, 167 
and effects of vocal abuse, 173 
growth, in patients with psychogenic voice disorders, 199 
normal, voice as indicator of, 167-168 
and vocal nodules 
in adults, 175-176, 249 
in children, 175, 249 
and voice, studies of, 167 
and voice/speech patterns, 170 
Perturbation 
definition of, 146 
measurement of, 146, 146f 
Perturbation plot, 160 
Perverse mutation, 32-33 
Pharyngeal arches, 358-359, 358f 
Pharyngeal fricative, 57f, 58 
Pharyngeal nerve, 367 
lesions, voice disorders caused by, 72t, 73 
Pharyngeal paralysis, 77 
Pharyngeal plexus, 61 
Pharyngeal pouch(es), 312, 313f, 316-318, 320f 
derivatives of, 316-318, 321f 322f 
fifth, derivatives of, 318 
first, 314f 
derivatives of, 315, 316, 321f, 322f 
fourth, derivatives of, 317, 322f 
second, derivatives of, 316, 322f 
third, derivatives of, 316-317, 321f, 322f 
Pharyngeal stop, 57f, 58 
Pharyngeal wall(s) 
lateral, 43, 43f 
posterior, 43, 43f 
Pharyngotympanic tube, embryology of, 322f 
Phase closure 
definition of, 155 
videostroboscopy of, 155 
Phase space, 147, 147f 
Phase symmetry 
definition of, 155 
videostroboscopy of, 155 
Philtrum, 51 
Phonation, 232 
abdominal-intercostal breathing coordinated with, 234 
anatomy of, 355-376 
definition of, 1, 2 
disordered, 101 
neurophysiology of, 71 
physiology of, 355-376 
respiratory disorders and, 349-353 
vagus nerve lesions and, 72t, 75f-76f 
Phonation quotient, vocal cord paralysis and, 80.78t 
Phonation threshold pressure (Pan), 150, 375 
of professional singer, 280 
Phonation volume (PV), 148 
Phonemes, velopharyngeal mechanism and, 39 
Phonetogram, of professional singer, 279-280 
Phonosurgery, 243-244 
for focal fold paralysis, 261 
for professional singers, indications for, 281-282 
Phylogeny, 313 
definition of, 302 
Physician-patient relationship, 134-135, 204 
Piano, 272t, 277 
Pick’s disease, mutism in, 95 


Pio. See Intraoral pressure (Pio) 
Pitch, vocal. See also Fundamental frequency; 
High pitch 
abnormal, 3 
appropriate 
in contact ulcer treatment, 250 
in voice therapy, 250 
changes at puberty, 16 
definition of, 2 
determinants of, 374 
modification, in management of adductor spasmodic 
dysphonia, 120 
normal, definition of, 2 
optimum, 236 
Pleasure, voice as indicator of, 12 
Plethysmography, inductance, 148 
Pleural cavity(ies), 324f 
primitive, 322 
Pneumotachography, 148 
Pneumotaxic center, 340f, 341-342, 342f 
Podule(s), 29 
Poliomyelitis, and vocal fold paralysis, 81 
Pollutants, 233 
Polyp(s), 29, 30f, 205, 205f 
gene expression profile for, 29 
incidence of, 7 
vocal abuse and, 173 
Polypoid degeneration, 29 
Pomum Adami, 355 
Posterior cricoarytenoid muscle, 
359f, 362, 363f 
Posterior glottal chink, 374 
Posterior nasal fricative, 57f 58 
Posterior nasal spine, 44, 44f 
Posterior pillar of the fauces, 43-44, 44f 46 
Postnasal drip, differential diagnosis of, 247t 
Precipitating factors, 233 
Premenstrual syndrome, and voice, 34 
Prepalate, cleft, 51t 
Presbylarynges, 6t 
Pressure. See also Phonation threshold pressure (Pn) 
alveolar, during respiratory cycle, 
337, 338f 
changes during respiratory cycle, 
337, 338f 
definition of, 150 
intraoral (Pio), 56, 150, 150f 
intrapleural, during respiratory cycle, 337, 338f 
subglottal (P,), 150. See also Subglottic air pressure 
Primates 
laryngeal descent in, 10 
larynges of, 2 
vocalizations of, 306-308, 369 
Procollis, 115 
Professional voice users. See also Actors; Singers 
definition of, 271 
oral contraceptive use by, 34 
types of, 271 
voice disorders in, 8 
Prognostic probes, techniques used as, 238 
Progressive muscular atrophy (PMA), 87 
Progressive relaxation, in voice therapy, 236, 245 
Progressive supranuclear palsy, 86 
Prosody, affective, 168 
Protein(s), structural, 357 
Proteomics, 29 
P,. See Subglottal pressure (P,) 
Pseudobulbar dysphonia, 160 
Pseudobulbar palsy, 86 
voice disorder caused by, treatment of, 263 
Psychodiagnostic interview 
case studies, 200-202 
rationale for, 194-197 


Index l-13 





Psychogenic voice disorder(s), 166-192, 171, 206, 206f. See also 
Abductor spasmodic dysphonia, psychogenic; Adductor 
spasmodic dysphonia, psychogenic; Aphonia, 
psychogenic; Dysphonia, psychogenic; High pitch, 
psychogenic; Hoarseness, psychogenic; Mutational 
falsetto, psychogenic; Mutism, psychogenic; 
Puberphonia; Velopharyngeal insufficiency, psychogenic 

case studies, 284-301 

clinical features of, 108 

course of, 135 

definition of, 6, 171 
epidemiology of, 7, 8 

etiology of, 6-7, 6t 

in females, 8 

implicating respiration, 352-353 
in neurologic disorders, 96 

in older adults, 18 

onset of, 135 

stages of personality growth in patients with, 199 
stress and, 170-172 

treatment of, 194-195, 195-197 
vocal fold bowing in, 35 

Psychoneuroimmunologic (PNI) framework, for stress-related 
voice responses, 171 

Psychopathology 

in patients with voice disorders, 199-203 
case studies, 200-202 
voice as indicator of, 6, 6t, 169-170 
Psychosis, 199 
Psychotherapist 
speech-language pathologist as, 194-195 
voice of, 169, 193 
Psychotherapy 
in contact ulcer treatment, 250 
therapist’s use of voice in, 169, 193 
Pterygoid hamulus, 44, 45f 
Pan. See Phonation threshold pressure (Pan) 
Puberphonia, 184, 352. See also Mutational falsetto 
case study, 289-290 
treatment of, 257-260 

Puberty. See also Adolescent voice change 
age of onset, 16 
in male versus female larynx, 184 
onset, factors affecting, 16 
voice development in, 15-16, 184 

Pulmonary function testing, 147-148 

Pulse register, 375 

Pyramidal corticobulbar system, lesions, dysphonia caused by, 
84-86 


Quadrangular membrane, 361 
Quality, of voice 

definition of, 2 

normal, definition of, 2 


R 
Radiation injury, to larynx, 35 
Ramp signal, 340 
Reactive anxiety, 242 
Rectus abdominis muscle, functional anatomy of, 333, 334t 
Recurrent laryngeal nerve, 316, 319¢, 367 
anatomy of, 74, 74f, 367 
lesions 
effects on phonation and resonation, 72t, 76f 
infants with, 73 
voice disorders caused by, 74-75 
resection, for adductor spasmodic dysphonia, 119-120 
Referral(s), to psychiatrist or psychologist, 195 
Regularity, of vibratory cycle 
definition of, 155 
videostroboscopy of, 155 
Reichert’s cartilage, 315f, 320f 


Reinke’s edema, 29 
gene expression profile for, 29 
Reinke’s space, 365 
Relaxation, in voice therapy, 236 
Residual volume, 147, 339, 339f 
Resonance balance, methods for achieving, 
236-237 
Resonance imbalance, 42 
Resonant voice, 236-237 
Resonant voice therapy, for vocal fatigue, 246-247 
Resonation, definition of, 1 
Respiration 
anatomy of, 326 
chemical control of, 341, 341f 
definition of, 1 
evolution of, 327-328 
external 
anatomy of, 328-334 
definition of, 328 
physiology of, 334-339 
forced, 331 
internal, definition of, 328 
laryngeal action during, 369 
neurophysiology of, 339-342, 342f 
physiology of, 326 
quiet, 331 
muscles of, 331-333, 332f 
for speech, 326, 344-354 
vital, versus speech respiration, 326, 344, 345f 
Respiratory bronchioles, 322 
Respiratory center, 340-341, 340f 
Respiratory control, in voice therapy, 234-236 
Respiratory cycle, 335-337, 336f, 337f 
flow changes during, 337, 338f 
pressure changes during, 337, 338f 
volume changes during, 337, 338f 
Respiratory disorders 
affecting phonation, 350 
and dysphonias, 326 
effects on phonation, 349-353 
Respiratory dysfunction, in adductor spasmodic 
dysphonia, 104 
Respiratory muscles, 331-334 
activity during speech, 344-346, 346f 
of forced exhalation, 333-334, 334t 
of forced inhalation, 333, 334t 
of forced respiration, 333-334 
of quiet inhalation, 331-332, 333t 
of quiet respiration, 331-333, 332f 333t 
Respiratory neurons 
dorsal, 340, 340f 
ventral, 340, 340f 
Respiratory rate, for speech, 344 
Respiratory system 
age-related changes in, 17-18 
air-volume measurements for, 147-148 
embryology of, 321-325, 323f, 324f 329 
human 
evolution of, 310 
phylogeny of, 328-329, 329t 
malformations, genetics of, 324-325 
Reticular fibers, 357 
Retired persons, voice disorders in, 8 
Retrocollis, 115 
Rhesus monkey, larynx of, 305-306, 308f 
Rheumatoid arthritis, 29-30 
Rhinolalia aperta, 41 
Rhinolalia clausa 
anterior, 42, 62 
posterior, 42, 62 
Rhinolalia mixta, 64 
Rhinosinusitis, differential diagnosis of, 247t 
Rib cage, anatomy of, 329-331, 330f, 331f 
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Right hemisphere, cortical lesions, and affective prosodic 
function, 168 

Ring, 272t, 274 

Robin sequence, 54 


S 
Salamander, larynx of, 303 
Scalenus anterior muscle, functional anatomy of, 332, 333t 
Scalenus medius muscle, functional anatomy of, 332, 333t 
Scalenus posterior muscle, functional anatomy of, 332, 333t 
Schizophrenia 
in adolescence, vocal characteristics with, 170 
in adulthood, vocal characteristics with, 170 
and affective prosodic function, 168 
in childhood, vocal characteristics with, 170 
voice as indicator of, 169-170 
Seizures, in children, mutism after, 95 
Serratus posterior superior muscle, functional anatomy 
of, 333, 334t 
Sex hormones, and voice, 33 
Shimmer, definition of, 146 
Sign, definition of, 4 
Signal(s) 
for acoustic analysis, 142 
type 1, 142 
type 2, 142 
type 3, 142 
Signal-to-noise ratio (SNR), 146 
Singers 
bass or bass-baritone, 279 
classical 
classification of, 278-279 
performance practices of, 276-277 
coloratura mezzo-soprano, 279 
coloratura soprano, 278 
dramatic, 278 
dramatic baritone (heldenbaritone), 279 
dramatic mezzo-soprano, 279 
dramatic soprano, 278 
geriatric, voices of, 18 
laryngeal inflammation in, corticosteroids for, 274 
lyric baritone, 279 
lyric mezzo-soprano, 278 
lyric soprano, 278 
nonclassical, performance practices of, 276 
operatic, classification of, 278-279 
pop, and vocal nodules, 176 
professional 
acoustic evaluation of, 279-280 
aerodynamic evaluation of, 279-280 
anterior neck, palpation of, 280 
business-related concerns of, 277 
clinical evaluation of, 277-281 
continuing education on, 282 
definition of, 271 
emotional concerns of, 275 
functional vocal requirements of, 271 
history-taking from, 272-277 
hydration for, 276 
indirect laryngoscopy in, 280-281 
laryngopharyngeal reflux in, 274-275 
medications used by, 273-274 
phonosurgery in, indications for, 281-282 
singing style of, 275-276 
smoking by, 276 
special considerations for, 271-283 
subtlety of symptoms in, 273 
treatment of, 281-282 
voice evaluation for, 271 
voice study/training of, 275 
voice use patterns not related to performance, 273 
soubrette soprano, 278 
spinto soprano, 278 


tenore dramatico (heldentenor), 279 
tenore leggiero, 279 
tenore lyrico (lyric tenor), 279 
vocal nodules in, 249 
voice disorders in, 264 
Singer’s formant, 272¢, 274 
Sjégren’s syndrome, 28 
Skirt bulging, in EGG signal, 151, 152f 
Skirt elevation, in EGG signal, 151, 152f 
Skirt ramping, in EGG signal, 151, 152f 
Smoking 
cessation 
in contact ulcer treatment, 250 
in voice therapy, 250 
and laryngeal cancer, 30 
and laryngeal mucosal hypertrophy, 28 
and laryngitis, 28 
by singers, 276 
Smooth muscle, 357 
SNR. See Signal-to-noise ratio (SNR) 
Soft palate, 42, 43, 43f, 44f. See also Cleft palate; Palate 
blood supply to, 45 
cleft, 51t, 55f 
configuration, assessment of, 66 
congenitally short, 59-60, 60f 
paralysis 
vagal lesions and, 61, 72t, 76-77 
voice changes caused by, 77 
Sound spectrography, 241 
Source-filter theory, 140-141, 140f 
Spasmodic dysphonia, 101-132 
adductor. See Adductor spasmodic dysphonia 
definition of, 101-102 
etiology of, 7, 101 
historical perspective on, 101 
incidence of, 101 
terminology for, 101-103 
Spasmodic torticollis, 113, 115 
Spastic dysphonia, 101-102 
Spastic dyspneumia, 351 
Spectral measures, 144-146, 145f 
Spectrography, 144-146, 145f 
Speech 
anatomical substrate of, 2 
childlike, in adult, 187 
case study, 187-188, 228-229, 229f 
contextual, assessment of, 64 
development, delayed, with cleft palate, 59 
intellectual basis of, 1-2 
intermittently whispered-phonated, 123 
muscle movement in, 49 
perceptual analysis of, 50 
production, source-filter theory of, 140-141, 140f 
respiration for, 344-354 
respiratory rhythm for, 235 
Speech anatomy, examination of, 65-66 
Speech-language pathologist 
continuing education for, on professional singers, 282 
interviewing and counseling skills needed by, 
196-197 
and psychological support for patient, 195-197 
Speed quotient (S.Q.), 373-374 
of inverse filtered waveform, 149, 149f 
Sphenoid bone, anatomy of, 44, 45f 
Sphenopalatine artery, 45 
Spirometry, wet, 148 
S.Q. See Speed quotient (S.Q.) 
Squamous cell carcinoma, laryngeal, 30, 31f 
Squeal, in infancy, 14 
Staccato, 272t, 277 
Sternocleidomastoid muscle, functional anatomy of, 333, 334t 
Sternohyoid muscle, 361, 362f 
innervation of, 75, 367 
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Sternothyroid muscle, 361, 362f 
innervation of, 75, 367 
Stockholm Voice Evaluation Approach (SVEA), 138-139 
Stomodeum, 313, 313f, 314f 317f, 358 
Stormy voice mutation, 16 
Strap muscles, 361 
Stress 
chronic, 206-207 
and contact ulcers, 177 
laryngo-responders to, 171 
and muscular tension, 244 
psychological/emotional 
and adductor spasmodic dysphonia, 106, 115 
and adductor spasmodic dysphonias, 109-113, 115 
and tremor, 115 
and vocal nodules, 175-176 
and voice disorders, 231-232 
and voice function, 135 
voice as indicator of, 12, 170-188 
case study, 172-173 
psychoneuroimmunologic (PNI) framework for, 171 
Stretch receptors, pulmonary, 341-342, 342f 
Striated muscle, 357 
Stridor 
of congenital subglottic stenosis, 25-26 
definition of, 25 
inhalatory. See also Croup 
differential diagnosis of, 78 
in infants with bilateral abductor paralysis, 73 
with laryngeal web, 26 
laryngeal, congenital, 25 
of laryngomalacia, 25 
Stroboscopy Evaluation Rating Form (SERF), 157, 158f-159f 
Stylohyoid muscle, 361, 362f 
innervation of, 74-75, 367 
Stylopharyngeus muscle, 361 
Subglottal pressure (P,), 150 
Subglottic air pressure 
and fundamental frequency, 348, 349f 
during speech, 344-345, 346, 346f 347f 
and vocal intensity, 349, 349f 
Subglottic region, 355 
Subglottic stenosis, congenital, 25-26 
Submucous cleft, 53, 55f 60, 60f 
congenital, 60, 60f 
Sulcus vocalis, 6t, 26 
Superior laryngeal artery, 366, 366f 
Superior laryngeal nerve, 316, 319¢, 367 
lesions 
effects on phonation and resonation, 
35, 72t, 76f 
voice disorders caused by, 74, 77-78, 78f 
Superior laryngeal vein, 366 
Supraglottic region, 355 
Suprahyoid muscles, 361 
innervation of, 74-75 
Supralaryngeal valving, 346-347, 347f 348f 
Supralaryngeal vocal tract (SVT), in humans, 308 
Surfactant, 329 
Surgery, for professional singers, indications for, 
281-282 
Sustainability, of normal voice, definition of, 3 
Swallowing 
biomechanics of, 46 
laryngeal action during, 370, 370f 
Swim-bladder 
characteristics of, 328-329, 329t 
evolution of, 328-329, 329t 
Sydenham’s chorea, dysphonia in, 
89-90, 90t 
Symptom, definition of, 4 
Syringobulbia, and vocal fold paralysis, 81 
s/z ratio, 148 


T 
Tachypnea, definition of, 349 
Tadpoles, gill function in, 327 
Tardive dyskinesia, 90 
Teachers 
and vocal abuse, 174 
voice disorders in, 7-8 
Teeth 
abnormalities of, with cleft palate, 59 
examination of, 66 
Teflon injection, 243 
pitfalls of, 262 
for vocal fold paralysis, 262 
Telemedicine, 242-243 
Temporal bone, anatomy of, 45 
Tendons, 357 
Tension-fatigue syndrome, 113 
Tensor tympani muscle, 45 
Tensor veli palatini 
activity, in speech, 49 
anatomy of, 45, 45f 
Terminal sacs, 322, 329 
Thermal injury, 35 
Third ventricle 
hemangioma of, akinetic mutism caused 
by, 95 
tumors of, akinetic mutism caused by, 95 
Thoracic cage, anatomy of, 329-331, 330f 
Thymus gland, embryology of, 317, 321f, 322f 
Thyroarytenoid muscle(s), 355, 359f 362, 363f 
atrophy, 35 
Thyrohyoid ligaments 
lateral, 361 
median, 361 
Thyrohyoid membrane, 361 
Thyrohyoid muscle, 361 
innervation of, 367 
Thyroid cartilage, 359f, 360, 360f 362f 
development of, 11 
sex differences in, 11 
Thyroid diverticulum, 321f 
Thyroidectomy, vocal fold paralysis caused by, 81 
Thyroid gland, embryology of, 322f 
Thyroid lamina, angle of, sex differences in, 15 
Thyroid notch, 352, 359f 
Thyroplasty, 243 
Tidal volume, 147, 339, 339f 
Tissue(s) 
connective, 356-357 
epithelial, 355-356 
laryngeal, 1 
muscle, 357 
types of, 355-357 
of vocal folds, 1 
Tobacco use, history-taking about, 135 
Tone focus, in voice therapy, 236 
Tongue 
anatomy of, 322f 
embryology of, 318-319, 320f, 321f, 322f 
median septum of, 319 
median sulcus of, 319, 322f 
Tongue trills, 237 
Tonsillar crypts, 316 
Tonsillar fossa, 316 
Torsion dystonia, 116 
Total lung capacity, 339, 339f 
Total lung volume, 147 
Tourette’s syndrome. See Gilles de la Tourette’s syndrome 
Trachea 
anatomy of, 328-329, 328f 
embryology of, 321, 323f 
Tracheal rings, 359f 
Transgender therapy. See Transsexualism 
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Transsexualism, 187 
voice treatment in, 194, 264-265 
Transverse arytenoid muscle, 359f, 362, 363f 
Transversus abdominis muscle, functional anatomy of, 
333, 334t 
Trauma, 6t 
cerebral, and foreign accent syndrome, 96 
laryngeal 
in adulthood, 34-35, 34f 
in infancy, 28 
palatal, 60 
velopharyngeal, 60 
vocal fold paralysis caused by, 81 
Treatment, 231-270 
clinician-patient agreement/disagreement about, 4 
feedback devices in, 241-242 
general voice therapy for, 233-242 
goals of, 5 
patients’ reasons for seeking, 5 
principles of, 231-233 
of professional singers, 281-282 
resonant voice therapy for, 236-237 
trial and error in, 237-238 
Tree shrew, larynx of, 305, 307f 
Tremor. See also Organic (essential) voice tremor; 
Palatopharyngolaryngeal myoclonus; Voice tremor 
benign heredofamilial, 91 
causes of, 91 
intention, 89 
normal, 91 
physiologic, 91 
Trial and error, in voice therapy, 237-238 
Trial therapy, 4 
Trigeminal nerve. See also Cranial nerve(s), V 
laryngeal innervation by, 366-367 
muscles innervated by, 61 
Trigger incident(s), 106 
Trisomy 11q, 324 
Tuberculum impar, 318-319, 320f 
Tubotympanic recess, 316, 321f 
Tympanic cavity, 316, 321f 
embryology of, 322f 
Type A personality, and voice/speech patterns, 170 
Type B personality, and voice/speech patterns, 170 


U 

Ultimobranchial body, 321f, 322f 

Unemployed persons, voice disorders in, 8 

Upper motor neuron disease 
bilateral (spastic) dysarthria, 61 
unilateral (flaccid) dysarthria, 61 
unilateral lesions and, 61 

Upper respiratory infection(s), 233 

Uvula, bifid, 53, 55f, 60, 60f 


V 
Vagus nerve. See also Cranial nerve(s), X 
anatomy of, 73, 74f 
laryngeal innervation by, 366-367, 367f 
lesions, 61 
associated signs, 72t 
diseases causing, 77 
effects on resonation, 72t, 75f-76f 
effects on vocal fold movement and phonation, 72t, 
75-79, 75f-76f 
infants with, 73 
and pyramidal tract lesions, combined, 86-87 
voice disorders caused by, 73-84 
pharyngeal nerve branch, 367 
recurrent laryngeal nerve branch, 367 
superior laryngeal nerve branch, 367 
Van der Woude syndrome, 54 
Variability, of voice parameters, definition of, 2 


Velar fricative, 57f, 58 
Velocardiofacial syndrome, 26, 48, 54, 60 
genetics of, 324 
Velopharyngeal insufficiency, 56 
acoustic studies of, 50 
causes of, 42 
congenital, 59-60, 60f 
neurologic, 61-62 
nonorganic, 61 
psychogenic, 62 
treatment of, 47-48 
visualization, in nasal endoscopy, 47-48 
and vocal nodules, 175 
Velopharyngeal mechanism 
acoustic studies of, 50 
anatomy of, 42-50 
bony, 44-45 
surface, 42-44, 43f 
biomechanics of, 45f, 46-50 
blood supply to, 45 
closure, biomechanics of, 46-47, 47f 
dynamic magnetic resonance imaging of, 48 
electromyographic studies of, 48-49, 48f 
muscles of, 45-46 
nasal airflow studies of, 49-50, 49f 
nasal endoscopy of, 47-48 
opening, biomechanics of, 46 
physiology of, 42-50 
trauma to, 60 
videofluoroscopic studies of, 46-47, 47f 
Velopharynx, functions of, 39 
Velum, 42, 43, 45f. See also Soft palate 
movement, in speech, 49 
Ventricular dysphonia, 178 
Ventricular phonation, 264 
Vibrato, 272t, 274, 277 
Vibratory amplitude, videostroboscopy of, 155 
Videofluoroscopy, 46-47, 47f 
Videokymography, 155-156, 156f, 157f 
quantitative image analysis, 157-160 
Videostroboscopy, laryngeal, 154-155 
quantitative image analysis, 157-160 
rating system for, 157, 158f-159f 
visual perceptual rating of, 157, 158f-159f 
Virilization, voice changes in, 32-33 
Visceral pleura, 322, 324f 
Visual analog scales, 138, 138f 
Visual perceptual rating, of laryngeal images, 157, 158f, 159f 
Vital capacity, 147-148, 339, 339f 
in males, 15 
Vitamin A deficiency, and tracheal or laryngeal cartilage 
malformations, 25-26 
Vocal abuse, 173-178, 233 
age and, 173-174 
in childhood, 7, 173-174, 247-248 
epidemiology of, 7 
etiology of, 173-174 
and laryngitis, 28 
in males versus females, 173-174 
in older adults, 18 
treatment of, 245-247 
in adults, 249-250 
and vocal nodules, 173, 175-176, 247-249 
Vocal accumulator, 241-242 
Vocal attack 
breathy, 368 
even/static, 368 
hard/glottal/stopped, 368 
modes of, 368 
Vocal cord dysfunction (VCD), 78 
Vocal efficiency, 150 
Vocal fatigue, 245-247 
and laryngitis, 28 


Index l-17 





pathophysiology of, 246 
symptoms of, 246 
Vocal fold body-cover, 365f, 366 
Vocal folds 
age-related changes in, 17-18 
anatomy of, 1 
benign lesions of, differential diagnosis of, 247t 
bowing of, 35 
cartilaginous portion of, histology of, 364, 365f 
discoloration of, in older adults, 17-18 
embryology of, 320, 323 
epithelium of, 365, 365f 
false, 364 
functions of, in human ancestors, 310 
healing, voice rest and, 239 
hyperadduction of, 173, 232 
and contact ulcers, 177 
hypoadduction of, 232 
lamina propria of, 11-12, 29, 365, 365f 
deep layer, 365-366, 365f 
intermediate layer, 365-366, 365f 
superficial layer, 365, 365f 
malignant lesions of, differential diagnosis of, 247t 
mass lesions of, 205, 205f 
voice disorders caused by, 135 
treatment of, 263-264 
membranous 
histology of, 364, 364f 
sex differences in, 15 
microstructure of, 364-366 
mucosa 
development of, 11 
vibratory function of, 374 
paralysis 
bilateral, 205, 206f 
treatment of, 260-262, 265 
etiology of, 81 
idiopathic, 79 
incidence of, 7 
in infants, 73 
producing flaccid dysphonia, etiology of, 81 
and risk of psychiatric problems, 169 
treatment of, 260-262, 265 
case studies, 261-262 
unilateral, 205, 206f 
treatment of, 260-262, 265 
vagal lesions and, 77 
voice changes caused by, 77 
structural changes of, pathological, 24 
true, 364 
ultrastructure of, development of, 323 
vagus nerve lesions and, 72t 
vibration 
body-cover theory of, 374 
vertical phase difference in, 374 
vibratory patterns, studies of, 153-156, 154 
Vocal function exercises, 242 
for vocal fatigue, 246 
Vocal hygiene, 241 
Vocalis muscle, 365-366, 365f 
Vocalization(s) 
cellular and extracellular damage during, 1 
and coordinated socialization in primates, 307 
differentiated, for communication in primates, 306-307 
in infancy 
acoustic features of, 13 
classification of, 14 
duration of, 13 
negative, 14 
positive, 14 
primate, 306-308, 369 
Vocal nodules, 29, 30f, 174-176, 205, 205f 
acute, 174 


in adults, 175, 249-250 
case studies, 175-176, 209-210 
in children, 174-175, 247-249 
dysphonia caused by, 174-175 
formation of, stages of, 174 
incidence of, 7 
microwebs and, 26 
occupations at risk for, 8 
personality factors and, 169, 249 
vocal abuse and, 173, 175-176, 247-249 
Vocal reeducation, 248, 249 
Vocal registers, 375 
Vocal sulci, 26 
Voice 
abnormal, 3-4 
as disorder of communication, 4 
incidence of, 7-8 
as sign of illness, 4 
as symptom of illness, 4 
terminology for, 2, 3 
clinical evaluation of, 134-165 
confidential, 241 
definition of, 2 
development of, 10-23 
in adolescence, 15-16 
in childhood, 14-15 
in Down syndrome, 28 
at puberty, 15-16 
functions of, 194 
as indicator of discrete emotions, 168-169 
as indicator of life stress, 170-188 
as indicator of normal personality, 167-168 
as indicator of psychopathology, 169-170 
musical, frequency range, 15, 16f 
normal, criteria for, 2-3 
percepts, and personality, 167 
personal significance of, 4, 194, 195-196 
physiologic, frequency range, 15, 16f 
psychotherapist’s use of, to increase therapeutic efficacy, 169 
resonant, 236-237 
social significance of, 4 
socioeconomic significance of, 4 
sound changes, perceptual studies of, 18-20 
sublinguistic functions of, 2 
use, history-taking about, 136 
Voice abuse. See Vocal abuse 
Voice breaks, in adolescents, 15, 184 
Voice cluster(s), 204-229 
breathy whispered intermittent group, 225-227 
case study, 225-227 
differential diagnosis of, 225f 
case studies, 204 
definition of, 204 
high-pitched group 
case study, 228-229 
differential diagnosis of, 229f 
husky breathy whispered continuous group, 205-211, 
205f-207f 
case studies, 207-211 
low-pitched hoarse group, 227-228 
case study, 227-228 
differential diagnosis of, 227f 
strained hoarse continuous group, 211-213 
case study, 211-213 
differential diagnosis of, 213, 213f 
strained hoarse voice arrest intermittent arrhythmic group, 
213-221 
case studies, 213-221 
differential diagnosis of, 214f 
strained hoarse voice arrest intermittent rhythmic group, 
221-223 
case study, 221-223, 223f 
differential diagnosis of, 222f 
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Voice cluster(s) (Cont.) 
types of, 204 
voice tremor group, 223-225 
case study, 223-225 
differential diagnosis of, 224f 
Voice disorder(s), 5-8 
aggravating factors, 233 
associated factors, history-taking about, 135 
Voice disorder(s) (Cont.) 
behavioral. See Behavioral voice disorder(s) 
classification of, 5 
by etiology, 5 
kinesiology, 5 
perceptual, 5 
with cleft palate, 59 
congenital, 6t 
conversion. See Conversion voice disorders 
course of, history-taking about, 135 
epidemiology of, 7-8 
etiology of, determination of, 4 
functional. See also Psychogenic voice disorder(s) 
psychiatric characteristics of patients with, 183t 
hyperfunctional, 232 
hypofunctional, 232 
incidence of, 7-8 
lesions causing, 233 
medical diagnosis of, 4 
multifactorial model of, 232 
of multiple etiology, 7 
musculoskeletal, course of, 135 
nature of, history-taking about, 135 
neurogenic, onset of, 135 
neurologic. See Neurologic voice disorder(s) 
nonorganic, 6, 166-167, 171, 205-206. See also Functional voice 
disorder(s); Psychogenic voice disorder(s) 
case study, 210-211 
in older adults, 18 
onset of 
history-taking about, 135 
significant factors/events associated with, 135 
organic, 205-206, 232 
course of, 135 
definition of, 6 
etiology of, 6, 6t 
patient’s overconcern about, 242 
personality factors and, 169 
precipitating factors, 233 
psychogenic. See Psychogenic voice disorder(s) 
psychologic effects of, case study, 207-208 
psychopathology commonly seen in, 
199-203 
psychosocial effects of 
assessment of, 136-137, 136f 
case study, 195-196 
psychosomatic, 171 
rebound psychologic effects of, 188 
related health and medical issues, history-taking about, 
135-136 
of structural origin, 24-36 
in adulthood, 28-36 
case studies, 31-32 
in childhood, 24-28 
Voice Handicap Index (VHI), 5, 136, 136f 
Voice Intensity Controller (VIC), 241 
Voice onset time (VOT), menstrual cycle and, 33 
Voice Outcome Survey (VOS), 137 
Voice production 
aerodynamic theory of, 364-366, 365f-366f 
age-related changes in, 18 
myoelastic-aerodynamic theory of, 357f, 371-374, 
371f-373f 
neurochronaxic theory of, 364 


Voice quality 
changes at puberty, 16 
listeners’ ratings of, reliability/variability of, 3 
modification, in management of adductor spasmodic 
dysphonia, 119 
Voice range profile (VRP), 144, 144f 
of professional singer, 279-280 
Voice-Related Quality of Life (V-RQOL), 136-137, 137f 
Voice rest, 238-241 
duration of, 240-241 
iatrogenic, 188-190, 241 
case study, 189-190 
indications for, 239 
postoperative, 239 
stages of, 239-240 
summary of, 240-241 
for vocal fatigue, 246 
Voice therapy, 231 
for actors, 264 
for adductor spastic dysphonia, 265 
benefits of, 233 
for conversion voice disorders, 251 
direct, 252 
for adductor spasmodic dysphonia, 119 
as facilitation technique, 238 
efficacy of, 265 
feedback devices in, 241-242 
general, 233-242 
indirect, 238 
individualization of, 232 
literature review of, 233 
for musculoskeletal tension disorders, 233, 244-245, 265 
outcomes with, 233 
in parkinsonism, 89 
principles of, 231-233, 265 
resonant, 236-237 
success of, 233 
symptomatic, 251 
for adductor spasmodic dysphonia, 118 
techniques for, classification of, 238 
for transsexuals, 264-265 
trial and error in, 237-238 
for vocal abuse, 233, 245-247, 265 
for vocal fold paralysis, 261 
for vocal nodules in children, 248-249 
Voice tremor, 223-225 
case study, 223-225 
differential diagnosis of, 224f 
organic (essential). See Organic (essential) voice tremor 
Volume 
definition of, 147 
measurement of, 147-148 
Vowel prolongation structured sentence production test, 118 
Vowels 
nasal resonation of, 39 
production of, in infancy, 10 
words beginning with, production of, in voice therapy, 250 
VRP. See Voice range profile (VRP) 


Ww 
Wail(s), in infancy, 14 
harsh, 14 
soft, 14 
Wallenberg’s syndrome, 81 
Wernicke’s encephalopathy, akinetic mutism caused by, 95 
Whispering 
in conversion aphonia, 179-180 
laryngeal muscle function in, 172 
Writer’s cramp, 115 


Y 
Yawn-sigh technique, in voice therapy, 236 
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